Statistical variables associated with places (and other entities)
Get the latest value from preferred source for a single variable
As an example, the latest population of CA:
SELECT Obs.Value AS Value
FROM `data_commons.Observation` AS Obs
WHERE Obs.observation_about = 'geoId/06' AND
Obs.variable_measured = 'Count_Person' AND
Obs.is_preferred_obs_across_facets
Get the latest value from specific source for a single variable
As an example, the latest CA population from ACS 5 Year Survey:
SELECT Obs.value AS Value
FROM `data_commons.Observation` AS Obs
WHERE Obs.observation_about = 'geoId/06' AND
Obs.variable_measured = 'Count_Person' AND
Obs.measurement_method = 'CensusACS5yrSurvey'
ORDER BY Obs.observation_date DESC
LIMIT 1
List variables available for a given place
As an example, the variables available for California state:
SELECT DISTINCT Obs.variable_measured AS Var
FROM `data_commons.Observation` AS Obs
WHERE Obs.observation_about = 'geoId/06'
ORDER BY Var;
List sources available for a given place/variable combination
As an example, the sources for count of housing units in California:
SELECT DISTINCT
Prov.name AS Name,
Prov.provenance_url AS URL,
Obs.measurement_method AS MeasurementMethod
FROM `data_commons.Observation` AS Obs
JOIN `data_commons.Provenance` AS Prov ON TRUE
WHERE Obs.observation_about = 'geoId/06' AND
Obs.variable_measured = 'Count_HousingUnit' AND
Prov.id = Obs.prov_id
ORDER BY Name
Get observations from preferred source for a single variable
As an example, CA population over time from preferred source:
SELECT Obs.observation_date AS Date,
Obs.Value AS Value
FROM `data_commons.Observation` AS Obs
WHERE Obs.observation_about = 'geoId/06' AND
Obs.variable_measured = 'Count_Person' AND
Obs.facet_rank = 1
ORDER BY Date
Get the latest value from preferred source for all the places of a given type X contained in place Y
As an example, the unemployment rate in counties of USA:
WITH ChildPlace AS (
SELECT id AS PlaceId FROM `data_commons.Place`
WHERE EXISTS(SELECT * FROM UNNEST(all_types) AS T WHERE T = 'County') AND
EXISTS(SELECT * FROM UNNEST(linked_contained_in_place) AS C WHERE C = 'country/USA')
)
SELECT O.observation_about AS PlaceId,
P.name AS PlaceName,
O.value AS Value,
FROM `data_commons.Observation` AS O
JOIN ChildPlace ON TRUE
JOIN `data_commons.Place` AS P ON TRUE
WHERE O.is_preferred_obs_across_facets AND
O.variable_measured = 'UnemploymentRate_Person' AND
O.observation_about = ChildPlace.PlaceId AND
O.observation_about = P.id
ORDER BY PlaceName