Open Data Portal Catalogue

The open data portal catalogue is a downloadable dataset containing some key metadata for the general datasets available on the Government of Canada's Open Data portal.

Resource 1 is generated using the ckanapi tool (external link)

Resources 2 - 8 are generated using the Flatterer (external link) utility.

Description of resources:

  1. Dataset is a JSON Lines (external link) file where the metadata of each Dataset/Open Information Record is one line of JSON. The file is compressed with GZip. The file is heavily nested and recommended for users familiar with working with nested JSON.

  2. Catalogue is a XLSX workbook where the nested metadata of each Dataset/Open Information Record is flattened into worksheets for each type of metadata.

  3. datasets metadata contains metadata at the dataset level. This is also referred to as the package in some CKAN documentation. This is the main table/worksheet in the SQLite database and XLSX output.

  4. Resources Metadata contains the metadata for the resources contained within each dataset.

  5. resource views metadata contains the metadata for the views applied to each resource, if a resource has a view configured.

  6. datastore fields metadata contains the DataStore information for CSV datasets that have been loaded into the DataStore. This information is displayed in the Data Dictionary for DataStore enabled CSVs.

  7. Data Package Fields contains a description of the fields available in each of the tables within the Catalogue, as well as the count of the number of records each table contains.

  8. data package entity relation diagram Displays the title and format for column, in each table in the Data Package in the form of a ERD Diagram. The Data Package resource offers a text based version.

  9. SQLite Database is a .db database, similar in structure to Catalogue. This can be queried with database or analytical software tools for doing analysis.

Datasets available for download

Additional Info

Field Value
Last Updated October 22, 2024, 15:10 (UTC)
Created October 1, 2024, 07:11 (UTC)
Domain / Topic
Domain or topic of the dataset being cataloged.
Format (CSV, XLS, TXT, PDF, etc)
File format of the dataset.
Dataset Size
Dataset size in megabytes.
Metadata Identifier
Metadata identifier – can be used as the unique identifier for catalogue entry
Published Date
Published date of the dataset.
2020-01-01
Time Period Data Span (start date)
Start date of the data in the dataset.
Time Period Data Span (end date)
End date of time data in the dataset.
GeoSpatial Area Data Span
A spatial region or named place the dataset covers.
Field Value
Access category
Type of access granted for the dataset (open, closed, service, etc).
Limits on use
Limits on use of data.
Location
Location of the dataset.
Data Service
Data service for accessing a dataset.
Owner
Owner of the dataset.
Treasury Board of Canada Secretariat | Secrétariat du Conseil du Trésor du Canada
Contact Point
Who to contact regarding access?
Publisher
Publisher of the dataset.
Publisher Email
Email of the publisher.
open-ouvert@tbs-sct.gc.ca
Accessed At
Date the data and metadata was accessed.
Field Value
Identifier
Unique identifier for the dataset.
Language
Language(s) of the dataset
Link to dataset description
A URL to an external document describing the dataset.
Persistent Identifier
Data is identified by a persistent identifier.
Globally Unique Identifier
Data is identified by a persistent and globally unique identifier.
Contains data about individuals
Does the data hold data about individuals?
Contains data about identifiable individuals
Does the data hold identifiable data about individual?
Contains Indigenous Data
Does the data hold data about Indigenous communities?
Field Value
Source
Source of the dataset.
https://open.canada.ca/data/en/dataset/c4c5c7f1-bfa6-4ff6-b4a0-c164cb2060f7
Version notes
Version notes about the dataset.
Is version of another dataset
Link to dataset that it is a version of.
Other versions
Link to datasets that are versions of it.
Provenance Text
Provenance Text of the data.
Provenance URL
Provenance URL of the data.
Temporal resolution
Describes how granular the date/time data in the dataset is.
GeoSpatial resolution in meters
Describes how granular (in meters) geospatial data is in the dataset.
GeoSpatial resolution (in regions)
Describes how granular (in regions) geospatial data is in the dataset.
Field Value
Indigenous Community Permission
Who holds the Indigenous Community Permission. Who to contact regarding access to a dataset that has data about Indigenous communities.
Community Permission
Community permission (who gave permission).
The Indigenous communities the dataset is about
Indigenous communities from which data is derived.
Field Value
Number of data rows
If tabular dataset, total number of rows.
Number of data columns
If tabular dataset, total number of unique columns.
Number of data cells
If tabular dataset, total number of cells with data.
Number of data relations
If RDF dataset, total number of triples.
Number of entities
If RDF dataset, total number of entities.
Number of data properties
If RDF dataset, total number of unique properties used by the triples.
Data quality
Describes the quality of the data in the dataset.
Metric for data quality
A metric used to measure the quality of the data, such as missing values or invalid formats.

0 Comments

Please login or register to comment.