CSV Bulk Download
We provide access to some of our data in a relational format in a public Google Cloud Storage bucket, which is available for CSV download. The tables are constructed such that each row represents a Place and each column represents a Statistical Variable.
These relational tables are organized by vertical, each within a different zip folder, which can be downloaded from the links below:
Each vertical zip folder contains tables for various Place categories:
all (all places),
us (US places),
non_us (non-US places),
county (US counties), and
zip (US zip codes). For each vertical and Place category, there are three types of tables:
value: Each cell contains the value of the latest observation for a given Statistical Variable and Place.
date: Each cell contains the date of the latest observation for a given Statistical Variable and Place.
provenance: Each cell contains the provenance URL of the latest observation for a given Statistical Variable and Place, as well as the measurement method, if provided. Measurement methods that are prefixed with
dcAggregate/represent Data Commons aggregated values.
The table names follow the pattern
[vertical]_[place_category]_[type] and are sharded into multiple CSV files. (For example, the file
demographics_all_date-00000-of-00456.csv contains a portion of the observation
demographics Statistical Variables and
all Places. In this case, the table has been sharded into 456 files.)
provenance tables can be joined using the first three columns, which contain information about the place:
place_name: The name(s) of the Place.
place_dcid: The Data Commons ID for the Place.
place_type: The type(s) of the Place.
Example Table Structure
Below is a subset of the
And the corresponding subset of the
And for the