TY - JOUR
T1 - Genome-wide Insights into the Patterns and Determinants of Fine-Scale Population Structure in Humans
AU - Biswas, Shameek
AU - Scheinfeldt, Laura B.
AU - Akey, Joshua M.
N1 - Funding Information:
We acknowledge the members of the Akey lab for helpful discussion. This research was supported in part by National Institutes of Health grant RO1 GM078105 and the Sloan Research Fellowship in Computational Biology to J.M.A. and by a National Human Genome Research Institute Interdisciplinary Training in Genomic Sciences grant (HG00035) to L.B.S.
PY - 2009/5/15
Y1 - 2009/5/15
N2 - Studying genomic patterns of human population structure provides important insights into human evolutionary history and the relationship among populations, and it has significant practical implications for disease-gene mapping. Here we describe a principal component (PC)-based approach to studying intracontinental population structure in humans, identify the underlying markers mediating the observed patterns of fine-scale population structure, and infer the predominating evolutionary forces shaping local population structure. We applied this methodology to a data set of 650K SNPs genotyped in 944 unrelated individuals from 52 populations and demonstrate that, although typical PC analyses focus on the top axes of variation, substantial information about population structure is contained in lower-ranked PCs. We identified 18 significant PCs, some of which distinguish individual populations. In addition to visually representing sample clusters in PC biplots, we estimated the set of all SNPs significantly correlated with each of the most informative axes of variation. These polymorphisms, unlike ancestry-informative markers (AIMs), constitute a much larger set of loci that drive genomic signatures of population structure. The genome-wide distribution of these significantly correlated markers can largely be accounted for by the stochastic effects of genetic drift, although significant clustering does occur in genomic regions that have been previously implicated as targets of recent adaptive evolution.
AB - Studying genomic patterns of human population structure provides important insights into human evolutionary history and the relationship among populations, and it has significant practical implications for disease-gene mapping. Here we describe a principal component (PC)-based approach to studying intracontinental population structure in humans, identify the underlying markers mediating the observed patterns of fine-scale population structure, and infer the predominating evolutionary forces shaping local population structure. We applied this methodology to a data set of 650K SNPs genotyped in 944 unrelated individuals from 52 populations and demonstrate that, although typical PC analyses focus on the top axes of variation, substantial information about population structure is contained in lower-ranked PCs. We identified 18 significant PCs, some of which distinguish individual populations. In addition to visually representing sample clusters in PC biplots, we estimated the set of all SNPs significantly correlated with each of the most informative axes of variation. These polymorphisms, unlike ancestry-informative markers (AIMs), constitute a much larger set of loci that drive genomic signatures of population structure. The genome-wide distribution of these significantly correlated markers can largely be accounted for by the stochastic effects of genetic drift, although significant clustering does occur in genomic regions that have been previously implicated as targets of recent adaptive evolution.
UR - http://www.scopus.com/inward/record.url?scp=65149089174&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=65149089174&partnerID=8YFLogxK
U2 - 10.1016/j.ajhg.2009.04.015
DO - 10.1016/j.ajhg.2009.04.015
M3 - Article
C2 - 19442770
AN - SCOPUS:65149089174
SN - 0002-9297
VL - 84
SP - 641
EP - 650
JO - American Journal of Human Genetics
JF - American Journal of Human Genetics
IS - 5
ER -