Predicted distributions of 65 groundfish species in Canadian Pacific waters

Description:

This dataset contains layers of predicted occurrence for 65 groundfish species as well as overall species richness (i.e., the total number of species present) in Canadian Pacific waters, and the median standard error per grid cell across all species. They cover all seafloor habitat depths between 10 and 1400 m that have a mean summer salinity above 28 PSU. Two layers are provided for each species: 1) predicted species occurrence (prob_occur) and 2) the probability that a grid cell is an occurrence hotspot for that species (hotspot_prob; defined as being in the lower of: 1) 0.8, or 2) the 80th percentile of the predicted probability of occurrence values across all grid cells that had a probability of occurrence greater than 0.05.). The first measure provides an overall prediction of the distribution of the species while the second metric identifies areas where that species is most likely to be found, accounting for uncertainty within our model. All layers are provided at a 1 km resolution.

Methods:

These layers were developed using a species distribution model described in Thompson et al. 2023. This model integrates data from three fisheries-independent surveys: the Fisheries and Oceans Canada (DFO) Groundfish Synoptic Bottom Trawl Surveys (Sinclair et al. 2003; Anderson et al. 2019), the DFO Groundfish Hard Bottom Longline Surveys (Lochead and Yamanaka 2006, 2007; Doherty et al. 2019), and the International Pacific Halibut Commission Fisheries Independent Setline Survey (IPHC 2021). Further details on the methods are found in the metadata PDF available with the dataset.

Abstract from Thompson et al. 2023:

Predictions of the distribution of groundfish species are needed to support ongoing marine spatial planning initiatives in Canadian Pacific waters. Data to inform species distribution models are available from several fisheries-independent surveys. However, no single survey covers the entire region and different gear types are required to survey the range of habitats that are occupied by groundfish. Bottom trawl gear is used to sample soft bottom habitat, predominantly on the continental shelf and slope, whereas longline gear often focuses on nearshore and hardbottom habitats where trawling is not possible. Because data from these two gear types are not directly comparable, previous species distribution models in this region have been limited to using data from one survey at a time, restricting their spatial extent and usefulness at a regional scale. Here we demonstrate a method for integrating presence-absence data across surveys and gear types that allows us to predict the coastwide distributions of 66 groundfish species in British Columbia. Our model leverages the use of available data from multiple surveys to estimate how species respond to environmental gradients while accounting for differences in catchability by the different surveys. Overall, we find that this integrated method has two main benefits: 1) it increases the accuracy of predictions in data-limited surveys and regions while having negligible impacts on the accuracy when data are already sufficient to make predictions, 2) it reduces uncertainty, resulting in tighter confidence intervals on predicted species occurrences. These benefits are particularly relevant in areas of our coast where our understanding of habitat suitability is limited due to a lack of spatially comprehensive long-term groundfish research surveys.

Data Sources:

Research data was provided by Pacific Science’s Groundfish Data Unit for research surveys from the GFBio database between 2003 and 2020 for all species which had at least 150 observations, across all gear type and survey datasets available.

Uncertainties:

These are modeled results based on species observations at sea and their related environmental covariate predictions that may not always accurately reflect real-world groundfish distributions though methods that integrate different data types/sources have been demonstrated to improve model inference by increasing the accuracy of the predictions and reducing uncertainty.

Datasets available for download

Additional Info

Field Value
Last Updated October 22, 2024, 15:19 (UTC)
Created October 1, 2024, 06:24 (UTC)
Domain / Topic
Domain or topic of the dataset being cataloged.
Biota
Format (CSV, XLS, TXT, PDF, etc)
File format of the dataset.
Dataset Size
Dataset size in megabytes.
Metadata Identifier
Metadata identifier – can be used as the unique identifier for catalogue entry
Published Date
Published date of the dataset.
2023-02-27
Time Period Data Span (start date)
Start date of the data in the dataset.
Time Period Data Span (end date)
End date of time data in the dataset.
GeoSpatial Area Data Span
A spatial region or named place the dataset covers.
Field Value
Access category
Type of access granted for the dataset (open, closed, service, etc).
Limits on use
Limits on use of data.
Location
Location of the dataset.
Data Service
Data service for accessing a dataset.
Owner
Owner of the dataset.
Fisheries and Oceans Canada | Pêches et Océans Canada
Contact Point
Who to contact regarding access?
Government of Canada; Fisheries and Oceans Canada; Pacific Science, 604-822-8419, Emily.Rubidge@dfo-mpo.gc.ca
Publisher
Publisher of the dataset.
Publisher Email
Email of the publisher.
Patrick.Thompson@dfo-mpo.gc.ca
Accessed At
Date the data and metadata was accessed.
Field Value
Identifier
Unique identifier for the dataset.
Language
Language(s) of the dataset
Link to dataset description
A URL to an external document describing the dataset.
Persistent Identifier
Data is identified by a persistent identifier.
Globally Unique Identifier
Data is identified by a persistent and globally unique identifier.
Contains data about individuals
Does the data hold data about individuals?
Contains data about identifiable individuals
Does the data hold identifiable data about individual?
Contains Indigenous Data
Does the data hold data about Indigenous communities?
Field Value
Source
Source of the dataset.
https://open.canada.ca/data/en/dataset/51c60d88-c6ac-4e1c-9724-83b6048aeccd
Version notes
Version notes about the dataset.
Is version of another dataset
Link to dataset that it is a version of.
Other versions
Link to datasets that are versions of it.
Provenance Text
Provenance Text of the data.
Provenance URL
Provenance URL of the data.
Temporal resolution
Describes how granular the date/time data in the dataset is.
GeoSpatial resolution in meters
Describes how granular (in meters) geospatial data is in the dataset.
GeoSpatial resolution (in regions)
Describes how granular (in regions) geospatial data is in the dataset.
Field Value
Indigenous Community Permission
Who holds the Indigenous Community Permission. Who to contact regarding access to a dataset that has data about Indigenous communities.
Community Permission
Community permission (who gave permission).
The Indigenous communities the dataset is about
Indigenous communities from which data is derived.
Field Value
Number of data rows
If tabular dataset, total number of rows.
Number of data columns
If tabular dataset, total number of unique columns.
Number of data cells
If tabular dataset, total number of cells with data.
Number of data relations
If RDF dataset, total number of triples.
Number of entities
If RDF dataset, total number of entities.
Number of data properties
If RDF dataset, total number of unique properties used by the triples.
Data quality
Describes the quality of the data in the dataset.
Metric for data quality
A metric used to measure the quality of the data, such as missing values or invalid formats.

0 Comments

Please login or register to comment.