Skip to content. | Skip to navigation

Personal tools

Restricted-Use Dataset Descriptions

Restricted-use, contractual data are available in the following linkable datasets. For information about obtaining these data, please read About Restricted-Use Data.

In-Home Interview Files

 

DESCRIPTION

VARIABLES

OBSERVATIONS

Wave I Adolescent In-Home Interview with AHPVT, Parent In-Home Questionnaire, and Adolescent In-School Questionnaire data attached

A merged file containing the Wave I In-Home Interview data, the Parent Questionnaire data (when available), the In-School Questionnaire data (when available), and the Add Health Picture Vocabulary data (when available). These data were collected in 1994–1995.

2,820

20,745

Wave II In-Home Interview

Data collected during the 1996 in-home interview.

2,540

14,738

Wave III In-Home Interview and STD assay results

Respondent data collected during the 2001–2002 in-home interview. Includes separate files for field interviewer characteristics and AHPVT results.

1,876

15,197

Wave IV  In-Home Interview

Respondent-level data collected during the 2008-2009 in-home interview.  Includes a separate file for the field interviewer characteristics.

974

15,701

 

School Files

 

DESCRIPTION

 

VARIABLES

OBSERVATIONS

School Information Data

Additional information about the individual schools.

20

172

Wave I and II School Administrator Questionnaires

Information from the Wave I (self-administered), Wave II (phone-administered) questionnaires answered by administrators at the sampled schools.

WI: 189

WII: 100

WI: 172

WII: 128

In-School Questionnaire

All responses to the In-School Questionnaire administered September 1994 through April 1995.

227

90,118

Wave I School Network Data

Network variables constructed from the in-school questionnaire data and friendship nominations.

440

75,871

Wave I Network Structure

For each school pair, these files contain a valued friendship network and information on sex, grade in school, race, school pair, and total number of nominations made, including those to non-matchable or out-of-school friends. The files are stored as arc/edge lists in the PAJEK.PAJ format. Information on this freely available network software is at http://vlado.fmf.uni-lj.si/pub/networks/pajek/.

6

varies by school pair

 

Friend Files

 

DESCRIPTION

 

VARIABLES

OBSERVATIONS

In-School Friendship Nominations

Identification numbers of the friends that the respondent nominated during the in-school interview.

 

11

 

90,118

Wave I In-Home Friendship Nominations

Identification numbers of the friends that the respondent nominated during the Wave I in-home interview.

12

 

20,745

 

Wave II In-Home Friendship Nominations

Identification numbers of the friends that the respondent nominated during the Wave II in-home interview.

11

 

14,738

 

Wave III Friend ID Numbers

Add Health respondents who were in the 7th or 8th grade at Wave I were asked to identify, from a list of 10 computer-generated names, which ones were current friends or which ones were their friends when they were in school together. Respondents were asked the questions in Section 6 (Friends) about each person selected from the list of 10. This dataset contains the altered identification numbers (AIDs) of the 10 computer-generated names.

11

3,572

 

Sibling Files

 

DESCRIPTION

 

VARIABLES

OBSERVATIONS

Wave I Adolescent Pair Data

Information that links and describes the sibling pairs.

11

3,139

Wave III Sibling ID Numbers

In section 5 (Relationships with Siblings) Add Health respondents are asked questions about their siblings who also participated in the Wave I or II in-home interviews. This dataset contains the AIDs for these siblings.

10

4,368

 

Contextual Files

 

DESCRIPTION

 

VARIABLES

OBSERVATIONS

Wave I Spatial Analysis Data

X, Y coordinates that can be used to calculate distances between friends in a school community.

4

20,301

Wave I Contextual Data

Community contextual variables based on state, county, tract, and block group levels derived from the Wave I addresses.

2,682

 

20,745

 

Wave II Contextual Data

Community contextual variables based on state, county, tract, and block group levels derived from the Wave II addresses.

2,682

 

14,738

 

Wave III Contextual Data

Community contextual variables based on state, county, tract, and block group levels derived from the Wave III addresses.

594

15,197

Wave I Respondent Grouping Files

Pseudo state, county, tract, and block group variables that allow grouping of Add Health respondents geographically (based on Wave I addresses).

5

20,745

 

Wave II Respondent Grouping Files

Pseudo state, county, tract, and block group variables that allow grouping of Add Health respondents geographically (based on Wave II addresses).

5

14,738

Wave III Respondent Grouping File

Pseudo state, county, tract, and block group variables in FIPS code format that allow grouping of Add Health respondents geographically (based on Wave III addresses).

2

15,197

 

Wave III Supplemental Files

 

DESCRIPTION

VARIABLES

OBSERVATIONS

ASHA Data

To receive the results of their STD assays, Wave III respondents called an Add Health dedicated number at the American Social Health Association. This dataset provides information on who called the results hotline and the date and time of the call.

75

4,279

BEM Scores

The masculinity and femininity raw and standard scores from the 30 item short form BEM Sex-Role Inventory are available in this file.

5

15,197

Cotinine Assays

This file contains the cotinine and 3-hydroxycotinine assay values for 963 Wave III respondents.

5

963

HPV and MGEN Data

 Assay results for human papillomavirus and mycoplasma genitalium are available for a subset of the Wave III respondents who provided a urine sample.

46

5,126

Mentor Codes

For Wave III respondents who reported having a mentor, the open-ended responses to the question “How did {HE/SHE} help you?” have been coded and are available in this file.

17

15,197

Urinalysis

This file contains nitrate, specific gravity, pH level, white blood cells, protein, glucose, ketone, urobilinogen, bilirubin, microalbumin, urine creatinine, and blood values from the Wave III urine specimens.

16

15,197

 

Wave IV Supplemental Files

 

DESCRIPTION

 

VARIABLES

OBSERVATIONS

Wave IV Constructed Variables

This file contains constructed variables on stress, depression, mastery, personality, arrest history, sexual behavior, smoking, and substance abuse created by Wave IV collaborators.

49

15,701

Wave IV Medication Data

This file provides the therapeutic classification codes for the medications reported at Wave IV.

6

10,711

 

Disposition Files

 

DESCRIPTION

 

VARIABLES

OBSERVATIONS

Wave I and II Disposition File

This file contains the types of data available for the Wave I respondents along with the outcome of the 16,706 respondents selected for Wave II.

9

20,745

Wave III Disposition File

This file contains the final outcome of the 20,058 cases fielded at Wave III.

3

20,058

Wave IV Disposition File

This file contains the final outcome of the 19,962 cases fielded at Wave IV.

2

19,962

 

Genetic File

 

DESCRIPTION

 

VARIABLES

OBSERVATIONS

Wave III DNA Data

Twin and full siblings interviewed at Wave III were asked to provide saliva samples for DNA analysis. This file contains the genotype values for DAT1 (dopamine transporter), DRD4 (dopamine receptor), and SLC6A4 (serotonin transporter), MAOA_V (monoamine oxidase A-uVNTR), DRD2 (dopamine D2 receptor), and CYP2A6 (cytochrome P450 2A6) from these samples. Also included are values for the following SNPs: rs2304297, rs892413, rs4950, rs13280604.

24

2,574

 

Wave III Education Files

 

DESCRIPTION

 

FILES

 

OBSERVATIONS

 

Academic Courses

These files contain academic status and/or performance indicators for math, science, foreign language, English, history, social sciences, physical education, and a combined overall category.

6

12,237

Academic Networks

The Network files provide information on social networks based on the respondents’ course-taking patterns.

3

varies by file

Context

School level contextual data are from the Common Core of Data (CCD), Private School Survey (PSS), the1990 and 2000 Census, and the Office of Civil Rights.

6

varies by file

Course Level

The data in this file are needed for merging the course-level curriculum data with other Education Files.

1

varies by file

Curriculum

These math and science curriculum data are derived from coding the textbooks schools reported using for each course offered in these two subjects.

6

varies by file

Linking

This file contains variables designed to link transcript data to academic or school years and to Add Health.

1

12,237

Primary

The Primary Component contains several types of indicators based on information collected from participating schools and listed directly on student transcripts such as student exit or graduation status and materials gathered from schools during the data collection process.

6

varies by file

Transition

This file contains variables explaining the respondents’ movement through the educational system.

1

12,217

Weights

Files containing the variables needed to weight the education data.

2

varies by file

 

Weight Files

 

DESCRIPTION

 

VARIABLES

OBSERVATIONS

School Weights

 The initial weights for the school are in this file.

2

132

School Administrator Weights

Variables needed to correct for design effects and weight the Wave I school administrator data.

3

164

In-School Weights

Variables needed to correct for design effects and weight the Wave I in-school data.

7

90,118

Wave I Grand Sample Weights

Variables needed to correct for design effects and weight the Wave I in-home data.

4

20,745

Wave II Grand Sample Weights

Variables needed to correct for design effects and weight the Wave II in-home data.

4

14,738

Wave III Grand Sample Weights

Variables needed to correct for design effects and weight the Wave III in-home data, including longitudinal and cross-sectional weights.

5

15,197

Wave IV Grand Sample Weights

Variables needed to correct for design effects and weight the Wave IV in-home data, including longitudinal and cross-sectional weights.

5

15,701

Wave III HPV-MGEN Weights

Sample weights for respondents with HPV and MGEN assay results are in this file.

5

14,322

In-School Weight Components

A weight component for each level of sampling (school and adolescents) has been created for each wave of data collection. This file contains the weight components needed for computing multilevel weights.

8

90,118

Wave I Weight Components

A weight component for each level of sampling (school and adolescents) has been created for each wave of data collection. This file contains the weight components needed for computing multilevel weights for Wave I.

5

20,745

Wave II Weight Components

A weight component for each level of sampling (school and adolescents) has been created for each wave of data collection. This file contains the weight components needed for computing multilevel weights for Wave II.

5

14,738

Wave III Weight Components

A weight component for each level of sampling (school and adolescents) has been created for each wave of data collection. This file contains the weight components needed for computing multilevel weights for Wave III.

6

15,197

Wave IV Weight Components

A weight component for each level of sampling (school and adolescents) has been created for each wave of data collection. This file contains the weight components needed for computing multilevel weights for Wave IV.

6

15,701

 

Wave I and III Obesity and Neighborhood Environment (ONE) Files

 

DESCRIPTION

 

VARIABLES

OBSERVATIONS

Wave I and III ACCRA Data

These data files report ACCRA Cost of Living Index data for Wave I and Wave III based on respondent location and the year and quarter of the Add Health interview.

WI: 38

WII: 79

WI: 20,745

WII: 15,197

Wave I and III Climate Files

This file contains the climate data for each Wave III respondent based on the nearest climate station. Information is available on precipitation, total snowfall, sky cover, temperature, and total hours of sunshine.

WI: 25

WIII: 25

WI: 20,745

WIII: 15,197

Wave I and III Connectivity Files

These files contain road network connectivity measures within 1, 3, 5, and 8.05 km (5 miles) of the Wave I and III respondent locations. 

WI: 57

WIII: 57

WI: 20,745

WIII: 15,197

Wave I and III Crime Files

The county-level crime data in these files are based on the Wave I and III respondent locations.

 

WI: 8

WIII: 8

 

WI: 20,745

WIII: 15,197

Wave I and III Employment Files

Certain county-level employment data from the U.S. Bureau of Labor Statistics are attached to Wave I and Wave III respondent locations.

WI: 47

WIII: 36

WI: 20,745

WIII: 15,197

Wave I and III Geocode Source

The data source of the Wave I and III respondent residential geocodes (latitude and longitude) are provided in these files.

WI: 2

WIII: 2

WI: 20,745

WIII: 15,197

Wave I  and III Land Cover Data

These files contain land cover metrics within 1, 3, 5, and 8.05 km (5 miles) of Wave I and III respondent locations.

WI: 237

WIII: 237

WI: 20,745

WIII: 15,197

Wave I and III Length of Day

These data files measure the number of hours of daylight at each Wave I and Wave III respondent location based on that respondent’s latitude and survey date.

WI: 2

WIII: 2

WI: 20,745

WIII: 15,197

Wave III Mobility Data

Reports the distance between each respondent’s geocoded point location for each survey wave and that respondent’s school location, along with the respondent’s move distance between each survey wave. 

7

20,745

Wave I and III Parks Data

The counts of public parks within a Euclidean distance of 1, 3, 5, and 8.05 kilometers (5 miles) of each respondent at Wave I and III are in these files.

WI: 43

WIII: 43

WI: 20,745

WIII: 15,197

Wave I and III Population Density Data

This file contains the proportion of 1990 U.S. Census block group population and area (in square meters) within 1, 3, 5, and 8.04672 km (5 mi) of each Wave I respondent.

WI: 9

WIII: 9

WI: 20,745

WIII: 15,197

Wave III MSA Pseudo Codes

The MSA pseudo code created for each respondent’s Wave III location is in this file.

 

2

 

15,197

Wave I and III Resources Data

These Add Health files provide data on the presence of various physical activity (PA) resources situated near respondent residences at Wave I and III.

 

WI: 749

WIII: 3741

 

WI: 20,745

WIII: 15,197

Wave I and III Road Type Length Data

The lengths of the various types of roads, as classified by the Census Feature Classification Codes, that comprise respondent locations at Wave I and Wave III are reported in these files.

WI: 53

WII: 53

WI: 20,745

WIII: 15,197

Wave I and III Rural-Urban Commuting Area (RUCA) Codes

These data files define the rural-urban commuting characteristics of Wave I and Wave III respondent locations at the U.S. Census tract-level using the 1990 and 2000 RUCA codes developed by the U.S. department of Agriculture’s Economic Research Service.

WI: 3

WIII: 2

WI: 20,745

WIII: 15,197

Wave I School Distance Measures

This file contains the distance between the geocoded point locations of each respondent’s Wave I location and that respondent’s school.

WI: 2

WI: 20,745

Wave I and III Urban Distances

Contains Euclidean distances to both 1990 and 2000 U.S. Census Urbanized Areas (UAs) for each Wave I respondent.  W3URBDST contains the Euclidean distance to 2000 U.S. Census Bureau-defined urbanized areas (UAs) for each Wave III respondent. 

 

 

WI: 3

WIII: 2

 

WI: 20,745

WIII: 15,197

Wave I and III Weather Data

This file contains weather data for each Wave III respondent based on the nearest weather station reporting data for the corresponding survey month and year.

WI: 8

WIII: 8

 

WI: 20,745

WIII: 15,197

 

Alcohol Density File

 

DESCRIPTION

 

VARIABLES

OBSERVATIONS

Wave III Alcohol Data

This Add Health data file measures the prevalence of alcohol outlets in respondent communities by reporting the tract-level density of establishments possessing on- and/or off-premise alcohol licenses.

10

15,197

 

Political Context Files

 

DESCRIPTION

 

VARIABLES

OBSERVATIONS

Wave I, II, III Political Data

The Add Health Political Context files provide an array of measures that describe the political environments in which Add Health respondents reside. These contextual variables include measures of commuting, election results for gubernatorial, presidential, and senatorial races, and voter registration law.

WI: 17

WII: 13

WIII: 44

 

WI: 20,745

WII: 14,738

WIII: 15,197