Welcome to Data for Canada
Mission
Data for Canada exists to bridge the gap between open data availability and data usability. We curate, clean, and re-engineer high-value Canadian datasets into high-performance, analysis-ready formats for researchers, developers, and systems.
The Problem
Canada creates incredible amounts of open data, from foundational road networks to federal census statistics. However, these datasets are often locked in legacy formats, fragmented portals, or structures that require significant engineering effort to normalize before they can be used. For a researcher or a system developer, the “time-to-insight” is often bottlenecked by data preparation.
The Solution
We act as the transformation layer. We aggregate datasets with permissive licenses and process them into “digestible” standards optimized for modern downstream applications.
- For Researchers: Skip the cleaning phase. Access normalized, documented data ready for analysis.
- For Systems: Standardized data structures designed to feed directly into pipelines, data warehouses, and downstream services.
High-Level Overview
flowchart TD
%% Define Styles
classDef source fill:#e1f5fe,stroke:#01579b,stroke-width:2px
classDef process fill:#fff9c4,stroke:#fbc02d,stroke-width:2px
classDef storage fill:#e0f2f1,stroke:#00695c,stroke-width:2px
classDef consumer fill:#f3e5f5,stroke:#8e24aa,stroke-width:2px
subgraph "Data Sources"
StatProducts[("Statistical Products")]:::source
Orthoimagery[("Orthoimagery")]:::source
end
subgraph "Processing Pipeline"
Raw[Raw Data Ingestion<br/>CSVs, Shapefiles, ECW]:::process
Transform[Transformation Engine]:::process
Opt[Optimization]:::process
end
subgraph "Dissemination Formats"
Parquet[("Parquet")]:::storage
FlatGeoBuf[("FlatGeoBuf")]:::storage
PMTiles[("PMTile")]:::storage
COG[("COG")]:::storage
end
subgraph "Distribution Infrastructure"
ObjectStorage[Object Storage]:::storage
DecentralizedDistribution[Decentralized Distribution]:::storage
Serverless[API & Static Files]:::storage
Metadata[Metadata]:::storage
end
subgraph "Experimental Infrastructure"
GeoServer
QGISServer[("QGIS Server")]
Martin
ZOOProject[("ZOO Project")]
BBOXServer[("BBOX Server")]
end
subgraph "Consumption / End Users"
DataSci[Researchers]:::consumer
WebApps[Web Applications]:::consumer
Systems[Systems]:::consumer
end
%% Relationships
StatProducts --> Raw
Raw --> Transform
Transform --> Opt
Opt --> Parquet
Opt --> FlatGeoBuf
Opt --> PMTiles
Opt --> COG
Parquet --> ObjectStorage
FlatGeoBuf --> ObjectStorage
PMTiles --> ObjectStorage
COG --> ObjectStorage
ObjectStorage --> Metadata
Metadata --> DecentralizedDistribution
Metadata --> Serverless
Metadata --> GeoServer
Metadata --> QGISServer
Metadata --> BBOXServer
Metadata --> Martin
Metadata --> ZOOProject
Metadata --> DataSci
Metadata --> WebApps
Metadata --> Systems
DecentralizedDistribution --> Systems
Serverless --> WebApps
Serverless --> DataSci
ObjectStorage --> GeoServer
ObjectStorage --> QGISServer
ObjectStorage --> BBOXServer
ObjectStorage --> Martin
ObjectStorage --> ZOOProject
GeoServer --> WebApps
QGISServer --> WebApps
BBOXServer --> WebApps
Martin --> WebApps
ZOOProject --> WebApps
GeoServer --> DataSci
QGISServer --> DataSci
BBOXServer --> DataSci
Martin --> DataSci
ZOOProject --> DataSci
click Parquet "https://github.com/apache/parquet-format" _blank
click FlatGeoBuf "https://flatgeobuf.org" _blank
click PMTiles "https://github.com/protomaps/PMTiles/blob/main/spec/v3/spec.md" _blank
click StatProducts "https://www.dataforcanada.org/docs/processes/statistical_products/" _blank
click Orthoimagery "https://www.dataforcanada.org/docs/processes/orthoimagery/" _blank
click DecentralizedDistribution "https://www.dataforcanada.org/docs/dissemination/" _blank
click Metadata "https://stac-utils.github.io/stac-geoparquet/latest/spec/stac-geoparquet-spec/" _blank
click Martin "https://martin.maplibre.org/" _blank
click GeoServer "https://geoserver.org/" _blank
click COG "https://cogeo.org/" _blank
click ZOOProject "https://zoo-project.org/" _blank
click BBOXServer "https://www.bbox.earth/" _blank