Air Pollution Index (PCA readme file)

CANUE specialists developed multi-pollutant indices of air quality. Three-year (2012-14) average datasets of trace gases and aerosols derived from satellite-based sensors were prepared for a common 10x10 km grid over North America (NA: Canada and the United States), including surface PM2.5 (in µg/m3) , NO2, SO2, CO and NH3 (all in ppb), and vertical column totals for HCHO (formaldehyde in molecules per cm2). Surface O3 concentrations from CANUE holdings (warm season average 8 hr max) were merged with these satellite-based data. Principal Component Analysis (PCA) was used to compute three pollutant mixture indicators for each 10x10 km grid. These values were normalized to a 0-100 scale. Each indicator is related to a different aspect of common air pollution mixtures occurring across North America (Canada and the U.S.).Indicator 1 (PC1) reflects combustion-related pollutant mixtures, typically linked to populated areas. PC1 is suggested for use as the primary multi-pollutant, long-term outdoor air pollution exposure index because it incorporates the largest diversity of pollutants (PM2.5, CO, NO2, formaldehyde (HCHO), SO2, NH3 and O3, listed here in order of their PCA loading coefficients (e.g., weights)).Indicator 2 (PC2) is related to O3 NH3 mixtures common in parts of western NA linked to higher elevations (i.e., background O3) and livestock grazing. Indicator 3 (PC3) is related to NH3 mixtures (e.g., with some weight from CO and PM2.5) associated with more intensive agricultural practices (e.g., crops, intensive raising of animals).Also included are the input air quality datasets.CANUE staff indexed the data to DMTI Spatial single-link postal codes.

Datasets available for download

Additional Info

Field Value
Last Updated April 18, 2024, 16:56 (UTC)
Created September 18, 2023, 23:37 (UTC)
Domain / Topic
Domain or topic of the dataset being cataloged.
Environment
Format (CSV, XLS, TXT, PDF, etc)
File format of the dataset.
.docx - application/vnd.openxmlformats-officedocument.wordprocessingml.document
Dataset Size
Dataset size in megabytes.
Metadata Identifier
Metadata identifier – can be used as the unique identifier for catalogue entry
Published Date
Published date of the dataset.
Time Period Data Span (start date)
Start date of the data in the dataset.
2012-01-01
Time Period Data Span (end date)
End date of time data in the dataset.
2014-12-31
GeoSpatial Area Data Span
A spatial region or named place the dataset covers.
Canada
fair_rda_a1_02d Yes
fair_rda_a1_03d No
fair_rda_a1_04d No
fair_rda_a1_05d No
fair_rda_a1_1_01d Yes
fair_rda_a1_2_01d No
fair_rda_i1_01d No
fair_rda_i1_02d No
fair_rda_i2_01d No
fair_rda_i3_01d Yes
fair_rda_r1_3_01d No
Field Value
Access category
Type of access granted for the dataset (open, closed, service, etc).
visible
Limits on use
Limits on use of data.
These data files are provided solely for the purposes stated in the CANUE Data Sharing and Use Agreement and should not be re-distributed for any reason. These data also contain proprietary postal code data and may only be used for the project named in the CANUE Data Sharing and Use Agreement. Data can be shared only within a project team for the exclusive purposes of teaching, academic research and publishing, and/or planning of educational services in accordance to DMTI End User Agreement associated with the Spatial Mapping Academic Research Tools (SMART) Program.
Location
Location of the dataset.
https://www.canue.ca
Data Service
Data service for accessing a dataset.
Owner
Owner of the dataset.
CANUE (Canadian Urban Environmental Health Research Consortium)|Dalla Lana School of Public Health, University of Toronto
Contact Point
Who to contact regarding access?
Publisher
Publisher of the dataset.
Publisher Email
Email of the publisher.
Author
Author of the dataset.
CANUE (Canadian Urban Environmental Health Research Consortium)|Dalla Lana School of Public Health, University of Toronto
Author Email
Email of the author.
info@canue.ca
Accessed At
Date the data and metadata was accessed.
2023-09-18
Field Value
Identifier
Unique identifier for the dataset.
43
Language
Language(s) of the dataset
English
Link to dataset description
A URL to an external document describing the dataset.
https://canue.ca/wp-content/uploads/2022/02/Read_Me_PCA.docx
Persistent Identifier
Data is identified by a persistent identifier.
No
Globally Unique Identifier
Data is identified by a persistent and globally unique identifier.
No
Contains data about individuals
Does the data hold data about individuals?
N/A
Contains data about identifiable individuals
Does the data hold identifiable data about individual?
N/A
Contains Indigenous Data
Does the data hold data about Indigenous communities?
N/A
Field Value
Version notes
Version notes about the dataset.
Is version of another dataset
Link to dataset that it is a version of.
Other versions
Link to datasets that are versions of it.
Provenance Text
Provenance Text of the data.
Provenance URL
Provenance URL of the data.
Temporal resolution
Describes how granular the date/time data in the dataset is.
NaN
GeoSpatial resolution in meters
Describes how granular (in meters) geospatial data is in the dataset.
GeoSpatial resolution (in regions)
Describes how granular (in regions) geospatial data is in the dataset.
Field Value
Indigenous Community Permission
Who holds the Indigenous Community Permission. Who to contact regarding access to a dataset that has data about Indigenous communities.
Community Permission
Community permission (who gave permission).
The Indigenous communities the dataset is about
Indigenous communities from which data is derived.
Field Value
Number of data rows
If tabular dataset, total number of rows.
Number of data columns
If tabular dataset, total number of unique columns.
Number of data cells
If tabular dataset, total number of cells with data.
Number of data relations
If RDF dataset, total number of triples.
Number of entities
If RDF dataset, total number of entities.
Number of data properties
If RDF dataset, total number of unique properties used by the triples.
Data quality
Describes the quality of the data in the dataset.
NoData = -9999 (for numeric fields) - NoData=null (for category fields) - Data insufficient to calculate value = -1111
Metric for data quality
A metric used to measure the quality of the data, such as missing values or invalid formats.

0 Comments

Please login or register to comment.