ABCDEFGHIJKLMNOPQRSTUVWXYZ
1
2
This spreadsheet is intended to function as a data dictionary providing metadata for tables contained in the All of Us Research Program data model. This data model incorporates the standard OMOP tables in addition to several custom tables used to support the Program's non-EHR data types and its research tools. The data dictionary documents all available fields and details tables, fields, and concepts which have been generalized or suppressed in accordance with All of Us privacy rules. A brief description of each tab is given below.
3
4
5
6
7
8
9
10
11
Visit the OMOP CDM Wiki for more information on the OMOP model
12
13
It should be noted that the intention of this spreadsheet is not to provide metadata documentation on all concepts available within the All of Us data model, only those which are altered by the Program's privacy rules. For a searchable database of available concepts with metadata, please visit the ODHSI Athena tool at http://athena.ohdsi.org (link below).
14
15
16
17
18
19
Visit the ODHSI Athena website for concept-level information
20
21
22
Change LogDetails changes made to this document AFTER the initial release of this CDR version.
23
OMOP-Compatible TablesLists all OMOP-Compatible fields present within the Program data model with relevant metadata, including a description, the data provenance, and whether the field was impacted by privacy methods.
24
Socioeconomic StatusDetails appended table containing socioeconomic status information by 3-digit zip code. Built to privacy methodology specifications, so no additional privacy rules apply.
25
WearablesContains information on Wearables data from Fitbit, including any privacy methodology applied - these are non-OMOP formatted tables appended to the CDR.
26
SerologyContains serology data information, including any privacy methodology applied - these are non-OMOP formatted tables appended to the CDR.
27
GenomicsContains brief description of genomics information available - data are in a unique format, but can be linked to other participant data in the CDR via the research_id.
28
Table SuppressionsDetails all tables which are suppressed (not available) in the data model.
29
Field SuppressionsDetails all fields (columns) which are suppressed (set to null) in the data model with relevant metadata, including a description and the data provenance.
30
Concept SuppressionsDetails all concepts (rows) which are suppressed (removed) from the data model with relevant metadata, including the Concept ID and the data provenance.
31
Field GeneralizationsDetails all fields (columns) which are generalized in the data model with relevant metadata, including a field description, the data provenance, a description of the generalization applied, and the expected generalization output.
32
Concept GeneralizationsDetails all concepts (rows) which are generalized in the data model with relevant metadata, including the Concept ID, the data provenance, a description of the generalization applied, and the expected generalization output.
33
Cleaning & ConformanceDetails cleaning and conformance rules run between on CDR Base and between CDR_base and CDR to shape data to adhere to clean norms and expectations.
34
Program Custom Concept IDsEnumerates concept IDs created specifically for program-specific needs, such as a generalized value.
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100