Development of a coastal species characterization approach using environmental DNA (eDNA) using the marker COI

Species characterization by environmental DNA (eDNA) is a method that allows the use of DNA released into the environment by organisms from various sources (secretions, faeces, gametes, tissues, etc.). It is a complementary tool to standard sampling methods for the identification of biodiversity. This project provides a list of invertebrates species whose DNA has been detected in water samples collected at 2018 using the marker COI.

The surveys were carried out in the summer of 2018 from August 11 to 14, between Forestville and Godbout (Haute-Côte-Nord). Sampling was carried out between 9-52 meters depth in 40 stations with one sample par station. Two liters of water were filtered through a 1.2 µm fiberglass filter. DNA extractions were performed with the DNeasy Blood and Tissue extraction kit (Qiagen). Negative field, extraction and PCR controls were added at the different stages of the protocol. Libraries at the COI locus were prepared by Genome Quebec and sequenced on an Illumina MiSeq PE250 system. The bioinformatics analysis of the sequences obtained was carried out using an in-house analysis pipeline as reported in Bourret et al. 2022. A first step made it possible to obtain a molecular operational taxonomic unit table (MOTU) using the cutadapt software for the removal of the adapters and the DADA2 R package for the filtration, fusion, chimera removal and data compilation. The MOTUs table was subsequently corrected by taking into account the negative controls, where the number of observations in the latter was removed from the linked samples. Singleton MOTUs have also been removed. Finally, the taxonomic assignments were carried out on the MOTUs using the IDTAXA classifier (present in the DECIPHIER R package) using a training set trained on the COI reference bank for Golf St-Laurent (GSL-rl v1.0, https://github.com/GenomicsMLI-DFO/MLI_GSL-rl) and a threshold of 40. Detections with an “Unreliable due to gaps” category were reported at the genus level only.

The file provided includes generic activity information, including site, station name, date, marker type, assignment types used for taxa identification, and a list of taxa or species. The list of taxa has been verified by a biodiversity expert from the Maurice-Lamontagne Institute.

This project was funded by Fisheries and Oceans Canada's Coastal Environmental Baseline Data Program under the Oceans Protection Plan. This initiative aims to acquire baseline environmental data that contributes to the characterization of significant coastal areas and supports evidence-based assessments and management decisions to preserve marine ecosystems.

Data are also available on SLGO platform : https://doi.org/10.26071/ogsl-cd4c205b-f63b

Datasets available for download

Additional Info

Field Value
Last Updated October 22, 2024, 16:20 (UTC)
Created October 1, 2024, 07:53 (UTC)
Domain / Topic
Domain or topic of the dataset being cataloged.
Environment, Oceans
Format (CSV, XLS, TXT, PDF, etc)
File format of the dataset.
Dataset Size
Dataset size in megabytes.
Metadata Identifier
Metadata identifier – can be used as the unique identifier for catalogue entry
Published Date
Published date of the dataset.
2022-12-02
Time Period Data Span (start date)
Start date of the data in the dataset.
Time Period Data Span (end date)
End date of time data in the dataset.
GeoSpatial Area Data Span
A spatial region or named place the dataset covers.
Field Value
Access category
Type of access granted for the dataset (open, closed, service, etc).
Limits on use
Limits on use of data.
Location
Location of the dataset.
Data Service
Data service for accessing a dataset.
Owner
Owner of the dataset.
Fisheries and Oceans Canada | Pêches et Océans Canada
Contact Point
Who to contact regarding access?
Publisher
Publisher of the dataset.
Publisher Email
Email of the publisher.
yanick.gendreau@dfo-mpo.gc.ca
Accessed At
Date the data and metadata was accessed.
Field Value
Identifier
Unique identifier for the dataset.
Language
Language(s) of the dataset
Link to dataset description
A URL to an external document describing the dataset.
Persistent Identifier
Data is identified by a persistent identifier.
Globally Unique Identifier
Data is identified by a persistent and globally unique identifier.
Contains data about individuals
Does the data hold data about individuals?
Contains data about identifiable individuals
Does the data hold identifiable data about individual?
Contains Indigenous Data
Does the data hold data about Indigenous communities?
Field Value
Source
Source of the dataset.
https://open.canada.ca/data/en/dataset/6319d10d-2ea7-44af-bfde-20e2da053d5a
Version notes
Version notes about the dataset.
Is version of another dataset
Link to dataset that it is a version of.
Other versions
Link to datasets that are versions of it.
Provenance Text
Provenance Text of the data.
Provenance URL
Provenance URL of the data.
Temporal resolution
Describes how granular the date/time data in the dataset is.
GeoSpatial resolution in meters
Describes how granular (in meters) geospatial data is in the dataset.
GeoSpatial resolution (in regions)
Describes how granular (in regions) geospatial data is in the dataset.
Field Value
Indigenous Community Permission
Who holds the Indigenous Community Permission. Who to contact regarding access to a dataset that has data about Indigenous communities.
Community Permission
Community permission (who gave permission).
The Indigenous communities the dataset is about
Indigenous communities from which data is derived.
Field Value
Number of data rows
If tabular dataset, total number of rows.
Number of data columns
If tabular dataset, total number of unique columns.
Number of data cells
If tabular dataset, total number of cells with data.
Number of data relations
If RDF dataset, total number of triples.
Number of entities
If RDF dataset, total number of entities.
Number of data properties
If RDF dataset, total number of unique properties used by the triples.
Data quality
Describes the quality of the data in the dataset.
Metric for data quality
A metric used to measure the quality of the data, such as missing values or invalid formats.

0 Comments

Please login or register to comment.