Data Quality Control in Social Surveys Using Genetic Information

Li, Yi; & Guo, Guang. (2014). Data Quality Control in Social Surveys Using Genetic Information. Biodemography and Social Biology, 60(2), 212-28.

Li, Yi; & Guo, Guang. (2014). Data Quality Control in Social Surveys Using Genetic Information. Biodemography and Social Biology, 60(2), 212-28.

Octet Stream icon 9115.ris — Octet Stream, 1 kB (1,345 bytes)

This article introduces a novel way of taking advantage of genetic data in social surveys for the purposes of data quality control. Genetic information could detect and repair data issues such as missing data, reporting errors, differences in measures of the same variable, and flawed data. Using data from two surveys, the College Roommate Study (ROOM) and the National Longitudinal Study of Adolescent Health (Add Health), we show that proportion identical by descent score (a measure of genetic relationships) can identify misreported and unreported sibling type and detect misrepresented participants, bio-ancestry score (a measure of ancestral population memberships) can repair and recover missing race and discrepancies among different measures of self-reported race, and sex chromosomal information may help cross-check self-reported sex. This article represents an initial effort to utilize genetic data for the purposes of data quality control. As genetic data become increasingly available, researchers may explore more approaches to improving data quality.




JOUR



Li, Yi
Guo, Guang



2014


Biodemography and Social Biology

60

2

212-28










9115

Wink Plone Theme by Quintagroup © 2013.

Personal tools
This is themeComment for Wink theme