TY - JOUR
T1 - Massive variation of short tandem repeats with functional consequences across strains of Arabidopsis thaliana
AU - Press, Maximilian O.
AU - McCoy, Rajiv C.
AU - Hall, Ashley N.
AU - Akey, Joshua M.
AU - Queitsch, Christine
N1 - Publisher Copyright:
© 2018 Press et al.
PY - 2018/8
Y1 - 2018/8
N2 - Short tandem repeat (STR) mutations may comprise more than half of the mutations in eukaryotic coding DNA, yet STR variation is rarely examined as a contributor to complex traits. We assessed this contribution across a collection of 96 strains of Arabidopsis thaliana, genotyping 2046 STR loci each, using highly parallel STR sequencing with molecular inversion probes. We found that 95% of examined STRs are polymorphic, with a median of six alleles per STR across these strains. STR expansions (large copy number increases) are found in most strains, several of which have evident functional effects. These include three of six intronic STR expansions we found to be associated with intron retention. Coding STRs were depleted of variation relative to noncoding STRs, and we detected a total of 56 coding STRs (11%) showing low variation consistent with the action of purifying selection. In contrast, some STRs show hypervariable patterns consistent with diversifying selection. Finally, we detected 133 novel STR-phenotype associations under stringent criteria, most of which could not be detected with SNPs alone, and validated some with follow-up experiments. Our results support the conclusion that STRs constitute a large, unascertained reservoir of functionally relevant genomic variation.
AB - Short tandem repeat (STR) mutations may comprise more than half of the mutations in eukaryotic coding DNA, yet STR variation is rarely examined as a contributor to complex traits. We assessed this contribution across a collection of 96 strains of Arabidopsis thaliana, genotyping 2046 STR loci each, using highly parallel STR sequencing with molecular inversion probes. We found that 95% of examined STRs are polymorphic, with a median of six alleles per STR across these strains. STR expansions (large copy number increases) are found in most strains, several of which have evident functional effects. These include three of six intronic STR expansions we found to be associated with intron retention. Coding STRs were depleted of variation relative to noncoding STRs, and we detected a total of 56 coding STRs (11%) showing low variation consistent with the action of purifying selection. In contrast, some STRs show hypervariable patterns consistent with diversifying selection. Finally, we detected 133 novel STR-phenotype associations under stringent criteria, most of which could not be detected with SNPs alone, and validated some with follow-up experiments. Our results support the conclusion that STRs constitute a large, unascertained reservoir of functionally relevant genomic variation.
UR - http://www.scopus.com/inward/record.url?scp=85050865183&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85050865183&partnerID=8YFLogxK
U2 - 10.1101/gr.231753.117
DO - 10.1101/gr.231753.117
M3 - Article
C2 - 29970452
AN - SCOPUS:85050865183
SN - 1088-9051
VL - 28
SP - 1169
EP - 1178
JO - Genome Research
JF - Genome Research
IS - 8
ER -