Restricted-Use Dataset Descriptions
Restricted-use, contractual data are available in the following linkable datasets. For information about obtaining these data, please read About Restricted-Use Data.
In-Home Interview Files
|
DESCRIPTION |
VARIABLES |
OBSERVATIONS |
|
Wave I Adolescent In-Home Interview with AHPVT, Parent In-Home Questionnaire, and Adolescent In-School Questionnaire data attached A merged file containing the Wave I In-Home Interview data, the Parent Questionnaire data (when available), the In-School Questionnaire data (when available), and the Add Health Picture Vocabulary data (when available). These data were collected in 1994–1995. |
2,820 |
20,745 |
|
Wave II In-Home Interview Data collected during the 1996 in-home interview. |
2,540 |
14,738 |
|
Wave III In-Home Interview and STD assay results Respondent data collected during the 2001–2002 in-home interview. Includes separate files for field interviewer characteristics and AHPVT results. |
1,876 |
15,197 |
|
Wave IV In-Home Interview Respondent-level data collected during the 2008-2009 in-home interview. Includes a separate file for the field interviewer characteristics. |
974 |
15,701 |
School Files
|
DESCRIPTION
|
VARIABLES |
OBSERVATIONS |
|
School Information Data Additional information about the individual schools. |
20 |
172 |
|
Wave I and II School Administrator Questionnaires Information from the Wave I (self-administered), Wave II (phone-administered) questionnaires answered by administrators at the sampled schools. |
WI: 189 WII: 100 |
WI: 172 WII: 128 |
|
In-School Questionnaire All responses to the In-School Questionnaire administered September 1994 through April 1995. |
227 |
90,118 |
|
Wave I School Network Data Network variables constructed from the in-school questionnaire data and friendship nominations. |
440 |
75,871 |
|
Wave I Network Structure For each school pair, these files contain a valued friendship network and information on sex, grade in school, race, school pair, and total number of nominations made, including those to non-matchable or out-of-school friends. The files are stored as arc/edge lists in the PAJEK.PAJ format. Information on this freely available network software is at http://vlado.fmf.uni-lj.si/pub/networks/pajek/. |
6 |
varies by school pair |
Friend Files
|
DESCRIPTION
|
VARIABLES |
OBSERVATIONS |
|
In-School Friendship Nominations Identification numbers of the friends that the respondent nominated during the in-school interview. |
11 |
90,118 |
|
Wave I In-Home Friendship Nominations Identification numbers of the friends that the respondent nominated during the Wave I in-home interview. |
12 |
20,745
|
|
Wave II In-Home Friendship Nominations Identification numbers of the friends that the respondent nominated during the Wave II in-home interview. |
11 |
14,738
|
|
Wave III Friend ID Numbers Add Health respondents who were in the 7th or 8th grade at Wave I were asked to identify, from a list of 10 computer-generated names, which ones were current friends or which ones were their friends when they were in school together. Respondents were asked the questions in Section 6 (Friends) about each person selected from the list of 10. This dataset contains the altered identification numbers (AIDs) of the 10 computer-generated names. |
11 |
3,572 |
Sibling Files
|
DESCRIPTION
|
VARIABLES |
OBSERVATIONS |
|
Wave I Adolescent Pair Data Information that links and describes the sibling pairs. |
11 |
3,139 |
|
Wave III Sibling ID Numbers In section 5 (Relationships with Siblings) Add Health respondents are asked questions about their siblings who also participated in the Wave I or II in-home interviews. This dataset contains the AIDs for these siblings. |
10 |
4,368 |
Contextual Files
|
DESCRIPTION
|
VARIABLES |
OBSERVATIONS |
|
Wave I Spatial Analysis Data X, Y coordinates that can be used to calculate distances between friends in a school community. |
4 |
20,301 |
|
Wave I Contextual Data Community contextual variables based on state, county, tract, and block group levels derived from the Wave I addresses. |
2,682
|
20,745
|
|
Wave II Contextual Data Community contextual variables based on state, county, tract, and block group levels derived from the Wave II addresses. |
2,682
|
14,738
|
|
Wave III Contextual Data Community contextual variables based on state, county, tract, and block group levels derived from the Wave III addresses. |
594 |
15,197 |
|
Wave I Respondent Grouping Files Pseudo state, county, tract, and block group variables that allow grouping of Add Health respondents geographically (based on Wave I addresses). |
5 |
20,745
|
|
Wave II Respondent Grouping Files Pseudo state, county, tract, and block group variables that allow grouping of Add Health respondents geographically (based on Wave II addresses). |
5 |
14,738 |
|
Wave III Respondent Grouping File Pseudo state, county, tract, and block group variables in FIPS code format that allow grouping of Add Health respondents geographically (based on Wave III addresses). |
2 |
15,197 |
Wave III Supplemental Files
|
DESCRIPTION |
VARIABLES |
OBSERVATIONS |
|
ASHA Data To receive the results of their STD assays, Wave III respondents called an Add Health dedicated number at the American Social Health Association. This dataset provides information on who called the results hotline and the date and time of the call. |
75 |
4,279 |
|
BEM Scores The masculinity and femininity raw and standard scores from the 30 item short form BEM Sex-Role Inventory are available in this file. |
5 |
15,197 |
|
Cotinine Assays This file contains the cotinine and 3-hydroxycotinine assay values for 963 Wave III respondents. |
5 |
963 |
|
HPV and MGEN Data Assay results for human papillomavirus and mycoplasma genitalium are available for a subset of the Wave III respondents who provided a urine sample. |
46 |
5,126 |
|
Mentor Codes For Wave III respondents who reported having a mentor, the open-ended responses to the question “How did {HE/SHE} help you?” have been coded and are available in this file. |
17 |
15,197 |
|
Urinalysis This file contains nitrate, specific gravity, pH level, white blood cells, protein, glucose, ketone, urobilinogen, bilirubin, microalbumin, urine creatinine, and blood values from the Wave III urine specimens. |
16 |
15,197 |
Wave IV Supplemental Files
|
DESCRIPTION
|
VARIABLES |
OBSERVATIONS |
|
Wave IV Constructed Variables This file contains constructed variables on stress, depression, mastery, personality, arrest history, sexual behavior, smoking, and substance abuse created by Wave IV collaborators. |
49 |
15,701 |
|
Wave IV Medication Data This file provides the therapeutic classification codes for the medications reported at Wave IV. |
6 |
10,711 |
Disposition Files
|
DESCRIPTION
|
VARIABLES |
OBSERVATIONS |
|
Wave I and II Disposition File This file contains the types of data available for the Wave I respondents along with the outcome of the 16,706 respondents selected for Wave II. |
9 |
20,745 |
|
Wave III Disposition File This file contains the final outcome of the 20,058 cases fielded at Wave III. |
3 |
20,058 |
|
Wave IV Disposition File This file contains the final outcome of the 19,962 cases fielded at Wave IV. |
2 |
19,962 |
Genetic File
|
DESCRIPTION
|
VARIABLES |
OBSERVATIONS |
|
Wave III DNA Data Twin and full siblings interviewed at Wave III were asked to provide saliva samples for DNA analysis. This file contains the genotype values for DAT1 (dopamine transporter), DRD4 (dopamine receptor), and SLC6A4 (serotonin transporter), MAOA_V (monoamine oxidase A-uVNTR), DRD2 (dopamine D2 receptor), and CYP2A6 (cytochrome P450 2A6) from these samples. Also included are values for the following SNPs: rs2304297, rs892413, rs4950, rs13280604. |
24 |
2,574 |
Wave III Education Files
|
DESCRIPTION
|
FILES |
OBSERVATIONS
|
|
Academic Courses These files contain academic status and/or performance indicators for math, science, foreign language, English, history, social sciences, physical education, and a combined overall category. |
6 |
12,237 |
|
Academic Networks The Network files provide information on social networks based on the respondents’ course-taking patterns. |
3 |
varies by file |
|
Context School level contextual data are from the Common Core of Data (CCD), Private School Survey (PSS), the1990 and 2000 Census, and the Office of Civil Rights. |
6 |
varies by file |
|
Course Level The data in this file are needed for merging the course-level curriculum data with other Education Files. |
1 |
varies by file |
|
Curriculum These math and science curriculum data are derived from coding the textbooks schools reported using for each course offered in these two subjects. |
6 |
varies by file |
|
Linking This file contains variables designed to link transcript data to academic or school years and to Add Health. |
1 |
12,237 |
|
Primary The Primary Component contains several types of indicators based on information collected from participating schools and listed directly on student transcripts such as student exit or graduation status and materials gathered from schools during the data collection process. |
6 |
varies by file |
|
Transition This file contains variables explaining the respondents’ movement through the educational system. |
1 |
12,217 |
|
Weights Files containing the variables needed to weight the education data. |
2 |
varies by file |
Weight Files
|
DESCRIPTION
|
VARIABLES |
OBSERVATIONS |
|
School Weights The initial weights for the school are in this file. |
2 |
132 |
|
School Administrator Weights Variables needed to correct for design effects and weight the Wave I school administrator data. |
3 |
164 |
|
In-School Weights Variables needed to correct for design effects and weight the Wave I in-school data. |
7 |
90,118 |
|
Wave I Grand Sample Weights Variables needed to correct for design effects and weight the Wave I in-home data. |
4 |
20,745 |
|
Wave II Grand Sample Weights Variables needed to correct for design effects and weight the Wave II in-home data. |
4 |
14,738 |
|
Wave III Grand Sample Weights Variables needed to correct for design effects and weight the Wave III in-home data, including longitudinal and cross-sectional weights. |
5 |
15,197 |
|
Wave IV Grand Sample Weights Variables needed to correct for design effects and weight the Wave IV in-home data, including longitudinal and cross-sectional weights. |
5 |
15,701 |
|
Wave III HPV-MGEN Weights Sample weights for respondents with HPV and MGEN assay results are in this file. |
5 |
14,322 |
|
In-School Weight Components A weight component for each level of sampling (school and adolescents) has been created for each wave of data collection. This file contains the weight components needed for computing multilevel weights. |
8 |
90,118 |
|
Wave I Weight Components A weight component for each level of sampling (school and adolescents) has been created for each wave of data collection. This file contains the weight components needed for computing multilevel weights for Wave I. |
5 |
20,745 |
|
Wave II Weight Components A weight component for each level of sampling (school and adolescents) has been created for each wave of data collection. This file contains the weight components needed for computing multilevel weights for Wave II. |
5 |
14,738 |
|
Wave III Weight Components A weight component for each level of sampling (school and adolescents) has been created for each wave of data collection. This file contains the weight components needed for computing multilevel weights for Wave III. |
6 |
15,197 |
|
Wave IV Weight Components A weight component for each level of sampling (school and adolescents) has been created for each wave of data collection. This file contains the weight components needed for computing multilevel weights for Wave IV. |
6 |
15,701 |
Wave I and III Obesity and Neighborhood Environment (ONE) Files
|
DESCRIPTION
|
VARIABLES |
OBSERVATIONS |
|
Wave I and III ACCRA Data These data files report ACCRA Cost of Living Index data for Wave I and Wave III based on respondent location and the year and quarter of the Add Health interview. |
WI: 38 WII: 79 |
WI: 20,745 WII: 15,197 |
|
Wave I and III Climate Files This file contains the climate data for each Wave III respondent based on the nearest climate station. Information is available on precipitation, total snowfall, sky cover, temperature, and total hours of sunshine. |
WI: 25 WIII: 25 |
WI: 20,745 WIII: 15,197 |
|
Wave I and III Connectivity Files These files contain road network connectivity measures within 1, 3, 5, and 8.05 km (5 miles) of the Wave I and III respondent locations. |
WI: 57 WIII: 57 |
WI: 20,745 WIII: 15,197 |
|
Wave I and III Crime Files The county-level crime data in these files are based on the Wave I and III respondent locations. |
WI: 8 WIII: 8 |
WI: 20,745 WIII: 15,197 |
|
Wave I and III Employment Files Certain county-level employment data from the U.S. Bureau of Labor Statistics are attached to Wave I and Wave III respondent locations. |
WI: 47 WIII: 36 |
WI: 20,745 WIII: 15,197 |
|
Wave I and III Geocode Source The data source of the Wave I and III respondent residential geocodes (latitude and longitude) are provided in these files. |
WI: 2 WIII: 2 |
WI: 20,745 WIII: 15,197 |
|
Wave I and III Land Cover Data These files contain land cover metrics within 1, 3, 5, and 8.05 km (5 miles) of Wave I and III respondent locations. |
WI: 237 WIII: 237 |
WI: 20,745 WIII: 15,197 |
|
Wave I and III Length of Day These data files measure the number of hours of daylight at each Wave I and Wave III respondent location based on that respondent’s latitude and survey date. |
WI: 2 WIII: 2 |
WI: 20,745 WIII: 15,197 |
|
Wave III Mobility Data Reports the distance between each respondent’s geocoded point location for each survey wave and that respondent’s school location, along with the respondent’s move distance between each survey wave. |
7 |
20,745 |
|
Wave I and III Parks Data The counts of public parks within a Euclidean distance of 1, 3, 5, and 8.05 kilometers (5 miles) of each respondent at Wave I and III are in these files. |
WI: 43 WIII: 43 |
WI: 20,745 WIII: 15,197 |
|
Wave I and III Population Density Data This file contains the proportion of 1990 U.S. Census block group population and area (in square meters) within 1, 3, 5, and 8.04672 km (5 mi) of each Wave I respondent. |
WI: 9 WIII: 9 |
WI: 20,745 WIII: 15,197 |
|
Wave III MSA Pseudo Codes The MSA pseudo code created for each respondent’s Wave III location is in this file. |
2 |
15,197 |
|
Wave I and III Resources Data These Add Health files provide data on the presence of various physical activity (PA) resources situated near respondent residences at Wave I and III. |
WI: 749 WIII: 3741 |
WI: 20,745 WIII: 15,197 |
|
Wave I and III Road Type Length Data The lengths of the various types of roads, as classified by the Census Feature Classification Codes, that comprise respondent locations at Wave I and Wave III are reported in these files. |
WI: 53 WII: 53 |
WI: 20,745 WIII: 15,197 |
|
Wave I and III Rural-Urban Commuting Area (RUCA) Codes These data files define the rural-urban commuting characteristics of Wave I and Wave III respondent locations at the U.S. Census tract-level using the 1990 and 2000 RUCA codes developed by the U.S. department of Agriculture’s Economic Research Service. |
WI: 3 WIII: 2 |
WI: 20,745 WIII: 15,197 |
|
Wave I School Distance Measures This file contains the distance between the geocoded point locations of each respondent’s Wave I location and that respondent’s school. |
WI: 2 |
WI: 20,745 |
|
Wave I and III Urban Distances Contains Euclidean distances to both 1990 and 2000 U.S. Census Urbanized Areas (UAs) for each Wave I respondent. W3URBDST contains the Euclidean distance to 2000 U.S. Census Bureau-defined urbanized areas (UAs) for each Wave III respondent. |
WI: 3 WIII: 2
|
WI: 20,745 WIII: 15,197 |
|
Wave I and III Weather Data This file contains weather data for each Wave III respondent based on the nearest weather station reporting data for the corresponding survey month and year. |
WI: 8 WIII: 8
|
WI: 20,745 WIII: 15,197 |
Alcohol Density File
|
DESCRIPTION
|
VARIABLES |
OBSERVATIONS |
|
Wave III Alcohol Data This Add Health data file measures the prevalence of alcohol outlets in respondent communities by reporting the tract-level density of establishments possessing on- and/or off-premise alcohol licenses. |
10 |
15,197 |
Political Context Files
|
DESCRIPTION
|
VARIABLES |
OBSERVATIONS |
|
Wave I, II, III Political Data The Add Health Political Context files provide an array of measures that describe the political environments in which Add Health respondents reside. These contextual variables include measures of commuting, election results for gubernatorial, presidential, and senatorial races, and voter registration law. |
WI: 17 WII: 13 WIII: 44
|
WI: 20,745 WII: 14,738 WIII: 15,197
|

