You are here: Home / Data / Restricted-Use Data / Restricted-Use Dataset Descriptions & Codebooks

Restricted-Use Dataset Descriptions & Codebooks

Restricted-use, contractual data are available in the following datasets.  Click the dataset title to access the associated codebook(s). Note: the codebooks are compressed (or zipped) files.  You will need to open the files with compression software such as 7zip (available to download for free online) in order to view the pdf files.

For more information about obtaining these data, please read About Restricted-Use Data.

 

CORE FILES

 

DESCRIPTION

ICPSR STUDY #

VARIABLES

OBSERVATIONS

Wave I In-Home Interview Data

A merged file containing the Wave I In-Home Interview data, the Parent Questionnaire data (when available), the In-School Questionnaire data (when available), and the Add Health Picture Vocabulary data (when available), collected in 1994–1995.

27021-0001

2,820

20,745

Wave II In-Home Interview Data

Data collected during the 1996 in-home interview.

27021-0002

2,540

14,738

Wave III In-Home Interview Data

Respondent-level data collected during the 2001–2002 in-home interview includes field interviewer characteristics, AHPVT.

27021-0003

through

27021-0011

1,876

15,197

Wave IV  In-Home Interview Data

Data collected during the 2008 in-home interview.

27021-0012

through

27021-0018

974

15,701

Wave I and II School Administrator Questionnaires

Information from the Wave I (self-administered), Wave II (phone-administered) questionnaires answered by administrators at the sampled schools.

27021-0020

through

27021-0021

WI: 189

WII: 100

WI: 172

WII: 128

School Information Data

Additional information about the individual schools.

27021-0022

20

172

Wave I In-School Questionnaire Data

Adolescent responses to the In-School Questionnaire administered September 1994 through April 1995.

27021-0019

227

90,118

Wave I In-Home Weights

Variables needed to correct for design effects and weight the Wave I in-home data.

27021-0025

4

20,745

Wave II In-Home Weights

Variables needed to correct for design effects and weight the Wave II in-home data.

27021-0026

4

14,738

Wave III Grand Sample Weights

Variables needed to correct for design effects and weight the Wave III in-home data, including longitudinal and cross-sectional weights.

27021-0027

5

15,197

Wave IV Grand Sample Weights

Variables needed to correct for design effects and weight the Wave IV in-home data, including longitudinal and cross-sectional weights.

27021-0028

5

15,701

School Administrator Weights

Variables needed to correct for design effects and weight the Wave I school administrator data.

27021-0023

3

164

In-School Weights

Variables needed to correct for design effects and weight the Wave I in-school data.

27021-0024

7

90,118

 

 

 

 

FRIEND FILES

 

DESCRIPTION

 

ICPSR STUDY #

VARIABLES

OBSERVATIONS

Wave I In-School Friendship Nominations

Identification numbers of the friends that the respondent nominated during the in-school interview.

27022-0003

11

90,118

Wave I In-Home Friendship Nominations

Identification numbers of the friends that the respondent nominated during the Wave I in-home interview.

27022-0001

12

20,745

 

Wave II In-Home Friendship Nominations

Identification numbers of the friends that the respondent nominated during the Wave II in-home interview.

27022-0002

11

14,738

 

Wave III Friend IDs

Add Health respondents who were in the 7th or 8th grade at Wave I were asked at Wave III to identify, from a list of 10 computer-generated names, which ones were current friends or which ones were their friends when they were in school together. This dataset contains the altered identification numbers (AIDs) of the 10 computer-generated names.

27022-0004

11

3,572

 

 

SIBLING FILES

 

DESCRIPTION

 

ICPSR STUDY #

VARIABLES

OBSERVATIONS

Adolescent Pairs Data

Information that links and describes the sibling pairs.

27023-0001

11

3,139

Wave III Sibling IDs

At Wave III, Add Health respondents were asked questions about their siblings who also participated in the Wave I or II in-home interviews. This dataset contains the AIDs for these siblings.

27023-0002

10

4,368

 

 

CONTEXTUAL FILES

 

DESCRIPTION

 

ICPSR STUDY #

VARIABLES

OBSERVATIONS

Wave I Contextual Data

Community contextual variables based on state, county, tract, and block group levels derived from the Wave I addresses.

27024-0001

2,682

 

20,745

 

Wave II Contextual Data

Community contextual variables based on state, county, tract, and block group levels derived from the Wave II addresses.

27024-0002

2,682

 

14,738

 

Wave III Contextual Data

Community contextual variables based on state, county, tract, and block group levels derived from the Wave III addresses.

27024-0003

594

15,197

Wave I Spatial Analysis Data

X, Y coordinates that can be used to calculate distances between friends in a school community.

27024-0004

4

20,301

Wave I Neighborhood Data

Pseudo state, county, tract, and block group variables that allow grouping of Add Health respondents geographically (based on Wave I addresses).

27024-0005

5

20,745

 

Wave II Neighborhood Data

Pseudo state, county, tract, and block group variables that allow grouping of Add Health respondents geographically (based on Wave II addresses).

27024-0006

5

14,738

Wave III Grouping File Data

Pseudo state, county, tract, and block group variables in FIPS code format that allow grouping of Add Health respondents geographically (based on Wave III addresses).

27024-0007

2

15,197

Wave III Region

This file contains the Census region codes for the respondents’ Wave III residential locations.

27024-0008

2

15,197

Wave IV Region

This file contains the Census region codes for the respondents’ Wave IV residential locations.

27024-0009

2

15,197

Wave IV Grouping Data

The pseudo FIPS codes in this file allow you to geographically group respondents by their Wave IV locations.

27024-0010 3 15,701

Wave III Supplemental Tract-Level Contextual Data

This file contains supplemental Wave III contextual data that include transportation and commuting measures, climate descriptors, amenities, and state-level tobacco control influences.  These variables are available at the census tract-level unless otherwise specified.

27024-0011 35 15,197

Wave IV Supplemental Tract-Level Contextual Data

This file contains tract-level measures, based on the Wave IV respondent locations, reported by the U.S. Census Bureau's 2009 American Community Survey (ACS), the Climate Atlas of the United States, the USDA Economics Research Service, Esri Data and Maps, ImpacTeen Tobacco Control policy and Prevalence Data, and the Uniform Crime Reports. When tract level measures were not available or appropriate, state and county level variables were used.

27024-0012 304 15,701

 

 

WAVE III SUPPLEMENTAL FILES

 

DESCRIPTION

ICPSR STUDY #

VARIABLES

OBSERVATIONS

Wave III ASHA Call Data

To receive the results of their STD assays, Wave III respondents called an Add Health dedicated number at the American Social Health Association. This dataset provides information on who called the results hotline and the date and time of the call.

27025-0001

75

4,279

Wave III BEM Scores Data

The masculinity and femininity raw and standard scores from the 30 item short form BEM Sex-Role Inventory are available in this file.

27025-0002

5

15,197

Wave III Cotinine Assays Data

This file contains the cotinine and 3-hydroxycotinine assay values for 963 Wave III respondents.

27025-0003

5

963

Wave III HPV-MGEN Assays Data

Assay results for human papillomavirus and mycoplasma genitalium are available for a subset of the Wave III respondents who provided a urine sample.

27025-0004

46

5,126

Wave III Mentor Data

For Wave III respondents who reported having a mentor, the open-ended responses to the question “How did {HE/SHE} help you?” have been coded and are available in this file.

27025-0005

17

15,197

Wave III Urinalysis Data

This file contains nitrate, specific gravity, pH level, white blood cells, protein, glucose, ketone, urobilinogen, bilirubin, microalbumin, urine creatinine, and blood values from the Wave III urine specimens.

27025-0006

16

15,197

Wave III HPV-MGEN Assay Weights

Sample weights for respondents with HPV and MGEN assay results are in this file.

27025-0007

5

14,322

 

 

WEIGHT COMPONENTS

 

DESCRIPTION

 

ICPSR STUDY #

VARIABLES

OBSERVATIONS

Wave I In-Home Weight Components

A weight component for each level of sampling (school and adolescents) has been created for each wave of data collection. This file contains the weight components needed for computing multilevel weights for Wave I.

27026-0001

 

 

5

20,745

Wave II In-Home Weight Components

A weight component for each level of sampling (school and adolescents) has been created for each wave of data collection. This file contains the weight components needed for computing multilevel weights for Wave II.

27026-0002

5

14,738

Wave III In-Home Weight Components

A weight component for each level of sampling (school and adolescents) has been created for each wave of data collection. This file contains the weight components needed for computing multilevel weights for Wave III.

27026-0003

6

15,197

Wave IV Weight Components

A weight component for each level of sampling (school and adolescents) has been created for each wave of data collection. This file contains the weight components needed for computing multilevel weights for Wave IV.

27026-0006

6

15,701

In-School Weight Components

A weight component for each level of sampling (school and adolescents) has been created for each wave of data collection. This file contains the weight components needed for computing multilevel weights.

27026-0004

8

90,118

Add Health School Weights

The initial weights for the school are in this file.

27026-0005

2

132

 

 

EDUCATION FILES

 

DESCRIPTION

 

ICPSR STUDY #

FILES

OBSERVATIONS

 

Academic Courses

These files contain academic status and/or performance indicators for math, science, foreign language, English, history, social sciences, physical education, and a combined overall category.

27030-0001

through

27030-0006

6

12,237

Academic Networks

The Network files provide information on social networks based on the respondents’ course-taking patterns.

27030-0007

through

27030-0009

3

varies by file

Context

School level contextual data are from the Common Core of Data (CCD), Private School Survey (PSS), the1990 and 2000 Census, and the Office of Civil Rights.

27030-0010

through

27030-0015

6

varies by file

Course Level

The data in this file are needed for merging the course-level curriculum data with other Education Files.

27030-0016

1

516,146

Curriculum

These math and science curriculum data are derived from coding the textbooks schools reported using for each course offered in these two subjects.

27030-0017

through

27030-0024

8

varies by file

Linking

This file contains variables designed to link transcript data to academic or school years and to Add Health.

27030-0025

1

12,237

Primary

The Primary Component contains several types of indicators based on information collected from participating schools and listed directly on student transcripts such as student exit or graduation status and materials gathered from schools during the data collection process.

27030-0028

through

27030-0030

5

varies by file

Transition

This file contains variables explaining the respondents’ movement through the educational system.

27030-31

1

12,217

Weights

Files contain weights for the education data along with the school weights needed for HLM analysis.

27030-0032

&

27030-0033

2

varies by file

 

 

GENETIC FILES

 

DESCRIPTION

 

ICPSR STUDY #

VARIABLES

OBSERVATIONS

Wave III DNA Data

Twin and full siblings interviewed at Wave III were asked to provide saliva samples for DNA analysis. This file contains the genotype values for DAT1 (dopamine transporter), DRD4 (dopamine receptor), and SLC6A4 (serotonin transporter), MAOA_V (monoamine oxidase A-uVNTR), DRD2 (dopamine D2 receptor), and CYP2A6 (cytochrome P450 2A6) from these samples. Also included are values for the following SNPs: rs2304297, rs892413, rs4950, rs13280604.

27031-0001

24

2,574

Wave IV DNA Data

Contains genotyping results for all Wave IV respondents who agreed to provide a saliva sample for DNA testing. This dataset has values for DAT1 (dopamine transporter), DRD4 (dopamine receptor), MAOA (monoamine oxidase A-uVNTR), 5HTTLPR (serotonin transporter), HTTLPR La-Lg-S, triallelic activity bins for the serotonin transporter 5HTTLPR adjusted for rs25531, DRD2, s000005, s000006, DRD5, and MAOCA1

27031-0002

30

15,701

 

 

CONSTRUCTED VARIABLES

 

DESCRIPTION

 

ICPSR STUDY #

VARIABLES

OBSERVATIONS

School Network Data

Network variables constructed from the in-school questionnaire data and friendship nominations.

27033-0002

440

75,871

Wave IV Constructed Variables

This file contains constructed variables on stress, depression, mastery, personality, arrest history, sexual behavior, smoking, and substance abuse created by Wave IV collaborators.

27033-0003

49

15,701

 

 

 

DISPOSITION FILES

 

DESCRIPTION

 

ICPSR STUDY #

VARIABLES

OBSERVATIONS

Wave I and II Disposition File

This file contains the types of data available for the Wave I respondents along with the outcome of the 16,706 respondents selected for Wave II.

27034-0003

9

20,745

Wave III Disposition File

This file contains the final outcome of the 20,058 cases fielded at Wave III.

27034-0001

3

20,058

Wave IV Disposition File

This file contains the final outcome of the 19,962 cases fielded at Wave IV.

27034-0002

2

19,962

National Death Index File

This file contains the underlying cause of death and days alive after Wave I interview.

27034-0004

 3

227

 

 

WAVE I & III OBESITY AND NEIGHBORHOOD ENVIRONMENT (ONE) FILES

 

DESCRIPTION

 

ICPSR STUDY #

VARIABLES

OBSERVATIONS

Wave I and III Climate Files

This file contains the climate data for each Wave III respondent based on the nearest climate station. Information is available on precipitation, total snowfall, sky cover, temperature, and total hours of sunshine.

27881-0001 & 27881-0012

WI: 25

WIII: 25

WI: 20,745

WIII: 15,197

Wave I and III Street Connectivity Files

These files contain road network connectivity measures within 1, 3, 5, and 8.05 km (5 miles) of the Wave I and III respondent locations.

27881-0002 & 27881-0013

WI: 57

WIII: 57

WI: 20,745

WIII: 15,197

Wave I and III Crime Files

The county-level crime data in these files are based on the Wave I and III respondent locations.

27881-0003 & 27881-0014

WI: 8

WIII: 8

WI: 20,745

WIII: 15,197

Wave I and III Geocode Source

The data source of the Wave I and III respondent residential geocodes (latitude and longitude) are provided in these files.

27881-0004 & 27881-0015

WI: 2

WIII: 2

WI: 20,745

WIII: 15,197

Wave I  and III Land Cover Data

These files contain land cover metrics within 1, 3, 5, and 8.05 km (5 miles) of Wave I and III respondent locations.

27881-0005 & 27881-0016

WI: 237

WIII: 237

WI: 20,745

WIII: 15,197

Wave I and III Parks Data

The counts of public parks within a Euclidean distance of 1, 3, 5, and 8.05 kilometers (5 miles) of each respondent at Wave I and III are in these files.

27881-0006 & 27881-0019

WI: 43

WIII: 43

WI: 20,745

WIII: 15,197

Wave I and III Resources Data

These Add Health files provide data on the presence of various physical activity (PA) resources situated near respondent residences at Wave I and III.

27881-0008 & 27881-0021

WI: 749

WIII: 3741

WI: 20,745

WIII: 15,197

Wave I and III Urban Distances

Contains Euclidean distances to both 1990 and 2000 U.S. Census Urbanized Areas (UAs) for each Wave I respondent.  Contains the Euclidean distance to 2000 U.S. Census Bureau-defined urbanized areas (UAs) for each Wave III respondent.

27881-0010 & 27881-0022

WI: 3

WIII: 2

 

WI: 20,745

WIII: 15,197

Wave I and III Weather Data

This file contains weather data for each Wave III respondent based on the nearest weather station reporting data for the corresponding survey month and year.

27881-0011 & 27881-0023

WI: 8

WIII: 8

 

WI: 20,745

WIII: 15,197

Wave I and III Population Density Data

The Wave I population density file contains the proportion of 1990 U.S. Census block group population and are (in square meters) within 1, 3, 5, and 8.04672 km (5 mi) of each Wave I respondent. The Wave III population density file contains the proportion of 2000 U.S. Census block group population and area (in square meters) within 1, 3, 5, and 8.04672 km (5 mi) of each Wave III respondent.

27881-0007 & 27881-0020

WI: 9

WIII: 9

WI: 20,745

WIII: 15,197

Wave I School Distance Measures

This file contains the distance between the geocoded point locations of each respondent’s Wave I location and that respondent’s school.

27881-0009

WI: 2

WI: 20,745

Wave III Mobility Data

Reports the distance between each respondent’s geocoded point location for each survey wave and that respondent’s school location, along with the respondent’s move distance between each survey wave.

27881-0017

7

20,745

Wave III MSA Pseudo Codes

The MSA pseudo code created for each respondent’s Wave III location is in this file.

27881-0018

2

15,197

Wave I and III ACCRA Data

These data files report ACCRA Cost of Living Index data for Wave I and Wave III based on respondent location and the year and quarter of the Add Health interview.

27881-0031 & 27881-0024

WI: 38

WII: 79

WI: 20,745

WII: 15,197

Wave I and III Employment Files

Certain county-level employment data from the U.S. Bureau of Labor Statistics are attached to Wave I and Wave III respondent locations.

27881-0032 & 27881-0025

WI: 47

WIII: 36

WI: 20,745

WIII: 15,197

Wave I and III Length of Day

These data files contain the number of hours of daylight at each Wave I and Wave III respondent location based on that respondent’s latitude and survey date.

27881-0033 & 27881-0026

WI: 2

WIII: 2

WI: 20,745

WIII: 15,197

Wave I and III Road Type Length Data

Road type length calculations within radii of 1, 3, 5, and 8.05 kilometers (5 miles) of Wave I and Wave III respondent locations.

27881-0034 & 27881-0027

WI: 53

WII: 53

WI: 20,745

WIII: 15,197

Wave I and III Rural-Urban Commuting Area (RUCA) Codes

These data files define the rural-urban commuting characteristics of Wave I and Wave III respondent locations at the U.S. Census tract-level using the 1990 and 2000 RUCA codes developed by the U.S. department of Agriculture’s Economic Research Service.

27881-0035 & 27881-0028

WI: 3

WIII: 2

WI: 20,745

WIII: 15,197

 

 

ALCOHOL DENSITY FILE

 

DESCRIPTION

 

ICPSR STUDY #

VARIABLES

OBSERVATIONS

Wave III Alcohol Outlet Density Data

This Add Health data file measures the prevalence of alcohol outlets in respondent communities by reporting the tract-level density of establishments possessing on- and/or off-premise alcohol licenses.

28841-0001

10

15,197

 

 

 

POLITICAL CONTEXT FILES

 

DESCRIPTION

 

ICPSR STUDY #

VARIABLES

OBSERVATIONS

Wave I, II, III Political Context Data

The Add Health Political Context files provide an array of measures that describe the political environments in which Add Health respondents reside. These contextual variables include measures of commuting, election results for gubernatorial, presidential, and senatorial races, and voter registration law.

28843-0001

through

28843-0003

WI: 17

WII: 13

WIII: 44

 

WI: 20,745

WII: 14,738

WIII: 15,197

 

 

 

MEDICATION FILE

 

DESCRIPTION

 

ICPSR STUDY #

VARIABLES

OBSERVATIONS

Medication File Data

This file provides the therapeutic classification codes for the medications reported at Wave IV.

29261-0001

6

10,711

 

 

WAVE IV BIOMARKER FILES

 

DESCRIPTION

 

ICPSR STUDY #

VARIABLES

OBSERVATIONS

Glucose – HbA1c

This file contains two measures of glucose homeostasis based on the assay of the Wave IV dried blood spots.

33443-0001

10

15,701

CRP-EBV

The results of the assays for CRP (C-reactive protein) and EBV (Epstein-Barr virus) are in this data file.

33443-0002

16

15,701

Wave IV Consent

This file contains variables indicating the types of consent (archive, no archive, refused, incarcerated) obtained for the Wave IV blood spot and saliva DNA collections

33443-0003

3

15,701

Lipids

This file contains constructed measures designed to facilitate analysis and interpretation of lipids results.

33443-0004

14

15,701

Baroreceptor Sensitivity
This file contains constructed measures for baroreflex sensitivity, heart rate recovery, and systolic blood pressure recovery for the Wave IV respondents.

 

4

15,701