TY - JOUR
T1 - Using the phenotype differences model to identify genetic effects in samples of partially genotyped sibling pairs
AU - Trejo, Sam
AU - Kanopka, Klint
N1 - Publisher Copyright:
© 2024 the Author(s).
PY - 2024/12/3
Y1 - 2024/12/3
N2 - The identification of causal relationships between specific genes and social, behavioral, and health outcomes is challenging due to environmental confounding from population stratification and dynastic genetic effects. Existing methods to eliminate environmental confounding leverage random genetic variation resulting from recombination and require within-family dyadic genetic data (i.e., parent-child and/or sibling pairs), meaning they can only be applied in relatively small and selected samples. We introduce the phenotype differences model and provide derivations showing that it-under plausible assumptions-provides consistent (and, in certain cases, unbiased) estimates of genetic effects using just a single individual's genotype. Then, leveraging distinct samples of fully and partially genotyped sibling pairs in the Wisconsin Longitudinal Study, we use polygenic indices and phenotypic data for 24 different traits to empirically validate the phenotype differences model. Finally, we utilize the model to test the effects of 40 polygenic indices on lifespan. After a 10% false discovery rate correction, we find that polygenic indices for three traits-body mass index, self-rated health, chronic obstructive pulmonary disease-have a statistically significant effect on an individual's lifespan.
AB - The identification of causal relationships between specific genes and social, behavioral, and health outcomes is challenging due to environmental confounding from population stratification and dynastic genetic effects. Existing methods to eliminate environmental confounding leverage random genetic variation resulting from recombination and require within-family dyadic genetic data (i.e., parent-child and/or sibling pairs), meaning they can only be applied in relatively small and selected samples. We introduce the phenotype differences model and provide derivations showing that it-under plausible assumptions-provides consistent (and, in certain cases, unbiased) estimates of genetic effects using just a single individual's genotype. Then, leveraging distinct samples of fully and partially genotyped sibling pairs in the Wisconsin Longitudinal Study, we use polygenic indices and phenotypic data for 24 different traits to empirically validate the phenotype differences model. Finally, we utilize the model to test the effects of 40 polygenic indices on lifespan. After a 10% false discovery rate correction, we find that polygenic indices for three traits-body mass index, self-rated health, chronic obstructive pulmonary disease-have a statistically significant effect on an individual's lifespan.
KW - biodemography
KW - causal inference
KW - genomics
KW - premature mortality
KW - quantitative methods
UR - http://www.scopus.com/inward/record.url?scp=85210962878&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85210962878&partnerID=8YFLogxK
U2 - 10.1073/pnas.2405725121
DO - 10.1073/pnas.2405725121
M3 - Article
C2 - 39589875
AN - SCOPUS:85210962878
SN - 0027-8424
VL - 121
JO - Proceedings of the National Academy of Sciences of the United States of America
JF - Proceedings of the National Academy of Sciences of the United States of America
IS - 49
M1 - e2405725121
ER -