Using positional distribution to identify splicing elements and predict pre-mRNA processing defects in human genes

Kian Huat Lim, Luciana Ferraris, Madeleine E. Filloux, Benjamin J. Raphael, William G. Fairbrother

Research output: Contribution to journalArticlepeer-review

224 Scopus citations

Abstract

We present an intuitive strategy for predicting the effect of sequence variation on splicing. In contrast to transcriptional elements, splicing elements appear to be strongly position dependent. We demonstrated that exonic binding of the normally intronic splicing factor, U2AF65, inhibits splicing. Reasoning that the positional distribution of a splicing element is a signature of its function, we developed a method for organizing all possible sequence motifs into clusters based on the genomic profile of their positional distribution around splice sites. Binding sites for serine/arginine rich (SR) proteins tended to be exonic whereas heterogeneous ribonucleoprotein (hnRNP) recognition elements were mostly intronic. In addition to the known elements, novel motifs were returned and validated. This method was also predictive of splicing mutations. A mutation in a motif creates a new motif that sometimes has a similar distribution shape to the original motif and sometimes has a different distribution. We created an intraallelic distance measure to capture this property and found that mutations that created large intraallelic distances disrupted splicing in vivo whereas mutations with small distances did not alter splicing. Analyzing the dataset of human disease alleles revealed known splicing mutants to have high intraallelic distances and suggested that 22% of disease alleles that were originally classified as missense mutations may also affect splicing. This category together with mutations in the canonical splicing signals suggest that approximately one third of all disease-causing mutations alter pre-mRNA splicing.

Original languageEnglish (US)
Pages (from-to)11093-11098
Number of pages6
JournalProceedings of the National Academy of Sciences of the United States of America
Volume108
Issue number27
DOIs
StatePublished - Jul 5 2011
Externally publishedYes

All Science Journal Classification (ASJC) codes

  • General

Fingerprint

Dive into the research topics of 'Using positional distribution to identify splicing elements and predict pre-mRNA processing defects in human genes'. Together they form a unique fingerprint.

Cite this