You are here: Home / Publications / Publications Database / Data quality control in social surveys using genetic information

Data quality control in social surveys using genetic information

Li, Y.; & Guo, G. (2014). Data quality control in social surveys using genetic information. Biodemography and social biology, 60(2), 212-28.

Li, Y.; & Guo, G. (2014). Data quality control in social surveys using genetic information. Biodemography and social biology, 60(2), 212-28.

Octet Stream icon 5266.ris — Octet Stream, 1 kB (1424 bytes)

This article introduces a novel way of taking advantage of genetic data in social surveys for the purposes of data quality control. Genetic information could detect and repair data issues such as missing data, reporting errors, differences in measures of the same variable, and flawed data. Using data from two surveys, the College Roommate Study (ROOM) and the National Longitudinal Study of Adolescent Health (Add Health), we show that proportion identical by descent score (a measure of genetic relationships) can identify "misreported" and unreported sibling type and detect misrepresented participants, bio-ancestry score (a measure of ancestral population memberships) can repair and recover missing race and discrepancies among different measures of self-reported race, and sex chromosomal information may help cross-check self-reported sex. This article represents an initial effort to utilize genetic data for the purposes of data quality control. As genetic data become increasingly available, researchers may explore more approaches to improving data quality.




JOUR



Li, Y.
Guo, G.



2014


Biodemography and social biology

60

2

212-28


2014/10/25




1948-5565

10.1080/19485565.2014.953029



5266