Glossary of Common Terms
This page contains a selection of key terms important to understanding the structure of data within Data Commons.
A group of entities sharing some characteristic. Interchangeably referred to in a Data Commons context as
CohortSet. Examples include the CDC’s list of the United States’ 500 largest cities.
Cohortdocumented at https://datacommons.org/browser/Cohort is a legacy type not used by the Sheets method
The date of measurement. Specified in ISO 8601 format. Examples include
2011 (the year 2011),
2019-06 (the month of June in the year 2019), and
2019-06-05T17:21:00-06:00 (5:17PM on June 5, 2019, in CST).
A unique identifier itemizing and classifying an entity in the Data Commons graph. For example, Austin, Texas, has a DCID of ‘geoId/4805000’, while the plant species Austrobaileya scandens has a DCID of ‘dc/bsmvthtq89217’.
The denominator of a fractional measurement. A complete list of properties can be found at https://datacommons.org/browser/measurementDenominator.
The technique used for measuring a statistical variable. Describes how a measurement is made, whether by count or estimate or some other approach. May name the group making the measurement to indicate a certain organizational method of measurement is used. Examples include the American Community Survey and
WorldHealthOrganizationEstimates. Multiple measurement methods may be specified for any given node. A complete list of properties can be found at https://datacommons.org/browser/measurementMethod.
Property of statistical variables that measure proportions, used in conjunction with the measurementDenominator property to indicate the multiplication factor applied to the proportion’s denominator (with the measurement value as the final result of the multiplication) when the numerator and denominator are not equal.
As an example, in 1999, approximately 36% of Canadians were Internet users. Here the measured value of
Count_Person_IsInternetUser_PerCapita is 36, and the scaling factor or denominator for this per capita measurement is 100. Without the scaling factor, we would interpret the value to be 36/1, or 3600%.
A complete list of properties can be found at https://datacommons.org/browser/scalingFactor.
Any type of metric, statistic, or measure that can be measured at a place and time. Examples include median income of persons older than 16, number of female high school graduates aged 18 to 24, unemployment rate, or percentage of persons with diabetes. A complete list of statistical variables can be found at https://datacommons.org/browser/StatisticalVariable.
A measurement of a
StatisticalVariable for a particular place and time. For example, a
StatVarObservation of the
Median_Income_Person for Brookmont, Maryland, in the year 2018 would be $126,199. A complete list of properties of statistical variable observations can be found at https://datacommons.org/browser/StatVarObservation.
A three-part grouping describing node and edge objects in the Data Commons graph.
Given tabular data such as the following:
|USA||United States of America||northamerica|
You can represent this data as a graph via subject-predicate-object “triples” that describe the node and edge relationships.
USA -- typeOf ------------> Country USA -- name --------------> United States of America USA -- containedInPlace --> northamerica