Skip to main content

Citation

Li, Yi & Guo, Guang (2014). Data Quality Control in Social Surveys Using Genetic Information. Biodemography and Social Biology, 60(2), 212-228. PMCID: PMC6642059

Abstract

This article introduces a novel way of taking advantage of genetic data in social surveys for the purposes of data quality control. Genetic information could detect and repair data issues such as missing data, reporting errors, differences in measures of the same variable, and flawed data. Using data from two surveys, the College Roommate Study (ROOM) and the National Longitudinal Study of Adolescent Health (Add Health), we show that proportion identical by descent score (a measure of genetic relationships) can identify misreported and unreported sibling type and detect misrepresented participants, bio-ancestry score (a measure of ancestral population memberships) can repair and recover missing race and discrepancies among different measures of self-reported race, and sex chromosomal information may help cross-check self-reported sex. This article represents an initial effort to utilize genetic data for the purposes of data quality control. As genetic data become increasingly available, researchers may explore more approaches to improving data quality.

URL

http://dx.doi.org/10.1080/19485565.2014.953029

Reference Type

Journal Article

Year Published

2014

Journal Title

Biodemography and Social Biology

Author(s)

Li, Yi
Guo, Guang

PMCID

PMC6642059

ORCiD

Guo - 0000-0002-4465-9881