Abstract
Variation in gene expression is thought to make a significant contribution to phenotypic diversity among individuals within populations. Although high-throughput cDNA sequencing offers a unique opportunity to delineate the genomewide architecture of regulatory variation, new statistical methods need to be developed to capitalize on the wealth of information contained in RNA-seq data sets. To this end, we developed a powerful and flexible hierarchical Bayesian model that combines information across loci to allow both global and locus-specific inferences about allele-specific expression (ASE). We applied our methodology to a large RNA-seq data set obtained in a diploid hybrid of two diverse Saccharomyces cerevisiae strains, as well as to RNA-seq data from an individual human genome. Our statistical framework accurately quantifies levels of ASE with specified false-discovery rates, achieving high reproducibility between independent sequencing platforms. We pinpoint loci that show unusual and biologically interesting patterns of ASE, including allele-specific alternative splicing and transcription termination sites. Our methodology provides a rigorous, quantitative, and high-resolution tool for profiling ASE across whole genomes.
| Original language | English (US) |
|---|---|
| Pages (from-to) | 1728-1737 |
| Number of pages | 10 |
| Journal | Genome Research |
| Volume | 21 |
| Issue number | 10 |
| DOIs | |
| State | Published - Oct 2011 |
| Externally published | Yes |
All Science Journal Classification (ASJC) codes
- Genetics
- Genetics(clinical)
Fingerprint
Dive into the research topics of 'A powerful and flexible statistical framework for testing hypotheses of allele-specific gene expression from RNA-seq data'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver