Development of a coastal species characterization approach using environmental DNA (eDNA) using the marker Mifish (12S)

Species characterization by environmental DNA (eDNA) is a method that allows the use of DNA released into the environment by organisms from various sources (secretions, faeces, gametes, tissues, etc.). It is a complementary tool to standard sampling methods for the identification of biodiversity. This project provides a list of fish and marine mammal species whose DNA has been detected in water samples collected between 2019 and 2021 using the mitochondrial marker MiFish (12S).

The surveys were carried out in the summer of 2019 (July 14-18) and (July 30 - August 5), in the fall of 2020 (October 27-28) and in the summer-fall of 2021 (May 31 - June 3 ) and (August 24-25) between Forestville and Godbout (Haute-Côte-Nord). Sampling was carried out between 1-50 meters depth in 91 stations, with 1 to 3 replicates per station. Two liters of water were filtered through a 1.2 µm fiberglass filter. DNA extractions were performed with the DNeasy Blood and Tissues or PowerWater extraction kit (Qiagen). Negative field, extraction and PCR controls were added at the different stages of the protocol. The libraries were prepared either by Génome Québec (2019, 2020) or by the Genomics Laboratory of the Maurice-Lamontagne Institute (2021), then sequenced on a NovaSeq 4000 PE250 system by Génome Québec. The bioinformatics analysis of the sequences obtained was carried out using an analysis pipeline developed in the genomics laboratory. A first step made it possible to obtain a table of molecular operational taxonomic units (MOTU) using the cutadapt software for the removal of the adapters and the R package DADA2 for the filtration, the fusion, removal of chimeras and compilation of data. The MOTUs table was then corrected using the R package metabaR to eliminate the tag-jumping and take contaminants into consideration. Samples showing a strong presence of contaminating MOTUs were removed from the dataset. The MOTUs were also filtered to remove all remaining adapter sequences and also retain only those of the expected size (around 170 bp). Finally, taxonomic assignments were made on the MOTUs using the BLAST+ program and the NCBI-nt database. Taxonomic levels (species, genus or family) were assigned using a best match method (Top hit), with a threshold of 95%. Only assignments at the level of fish and marine mammals were considered, and the taxa detected were compared to a list of regional species, and corrected if necessary. The species detections of the different replicas have been combined.

The file provided includes generic activity information, including site, station name, date, marker type, assignment types used for taxa identification, and a list of taxa or species. The list of taxa has been verified by a biodiversity expert from the Maurice-Lamontagne Institute.

This project was funded by Fisheries and Oceans Canada's Coastal Environmental Baseline Data Program under the Oceans Protection Plan. This initiative aims to acquire baseline environmental data that contributes to the characterization of significant coastal areas and supports evidence-based assessments and management decisions to preserve marine ecosystems.

Data were also published on SLGO platform : https://doi.org/10.26071/ogsl-2239bca5-c24a

Datasets available for download

Additional Info

Field Value
Last Updated October 22, 2024, 16:19 (UTC)
Created October 1, 2024, 07:53 (UTC)
Domain / Topic
Domain or topic of the dataset being cataloged.
Biota, Environment, Oceans
Format (CSV, XLS, TXT, PDF, etc)
File format of the dataset.
Dataset Size
Dataset size in megabytes.
Metadata Identifier
Metadata identifier – can be used as the unique identifier for catalogue entry
Published Date
Published date of the dataset.
2022-12-02
Time Period Data Span (start date)
Start date of the data in the dataset.
Time Period Data Span (end date)
End date of time data in the dataset.
GeoSpatial Area Data Span
A spatial region or named place the dataset covers.
Field Value
Access category
Type of access granted for the dataset (open, closed, service, etc).
Limits on use
Limits on use of data.
Location
Location of the dataset.
Data Service
Data service for accessing a dataset.
Owner
Owner of the dataset.
Fisheries and Oceans Canada | Pêches et Océans Canada
Contact Point
Who to contact regarding access?
Government of Canada;Fisheries and Oceans Canada;Demersal and Benthic Science Branch, Government of Canada;Fisheries and Oceans Canada;Demersal and Benthic Science Branch, 418-775-0647, 418-775-0647, audrey.bourret@dfo-mpo.gc.ca, Sandra.Velasquez@dfo-mpo.gc.ca
Publisher
Publisher of the dataset.
Publisher Email
Email of the publisher.
yanick.gendreau@dfo-mpo.gc.ca
Accessed At
Date the data and metadata was accessed.
Field Value
Identifier
Unique identifier for the dataset.
Language
Language(s) of the dataset
Link to dataset description
A URL to an external document describing the dataset.
Persistent Identifier
Data is identified by a persistent identifier.
Globally Unique Identifier
Data is identified by a persistent and globally unique identifier.
Contains data about individuals
Does the data hold data about individuals?
Contains data about identifiable individuals
Does the data hold identifiable data about individual?
Contains Indigenous Data
Does the data hold data about Indigenous communities?
Field Value
Source
Source of the dataset.
https://open.canada.ca/data/en/dataset/f37aaeca-717d-4f13-bc42-bad029dcc9cb
Version notes
Version notes about the dataset.
Is version of another dataset
Link to dataset that it is a version of.
Other versions
Link to datasets that are versions of it.
Provenance Text
Provenance Text of the data.
Provenance URL
Provenance URL of the data.
Temporal resolution
Describes how granular the date/time data in the dataset is.
GeoSpatial resolution in meters
Describes how granular (in meters) geospatial data is in the dataset.
GeoSpatial resolution (in regions)
Describes how granular (in regions) geospatial data is in the dataset.
Field Value
Indigenous Community Permission
Who holds the Indigenous Community Permission. Who to contact regarding access to a dataset that has data about Indigenous communities.
Community Permission
Community permission (who gave permission).
The Indigenous communities the dataset is about
Indigenous communities from which data is derived.
Field Value
Number of data rows
If tabular dataset, total number of rows.
Number of data columns
If tabular dataset, total number of unique columns.
Number of data cells
If tabular dataset, total number of cells with data.
Number of data relations
If RDF dataset, total number of triples.
Number of entities
If RDF dataset, total number of entities.
Number of data properties
If RDF dataset, total number of unique properties used by the triples.
Data quality
Describes the quality of the data in the dataset.
Metric for data quality
A metric used to measure the quality of the data, such as missing values or invalid formats.

0 Comments

Please login or register to comment.