TY - JOUR
T1 - Variant-resolved prediction of context-specific isoform variation with a graph-based attention model
AU - Litman, Aviya
AU - Pan, Zhicheng
AU - Sokolova, Ksenia
AU - Fang, Joyce
AU - Marvin, Tess
AU - Sauerwald, Natalie
AU - Park, Christopher Y.
AU - Theesfeld, Chandra L.
AU - Troyanskaya, Olga G.
N1 - Publisher Copyright:
© 2025 The Author(s).
PY - 2026
Y1 - 2026
N2 - In eukaryotes, most genes produce multiple transcript isoforms that diversify the transcriptome and proteome, serving as a key mechanism of functional regulation. Genetic variation can disrupt the RNA processing signals that shape isoform structure and abundance, yet modeling these effects at full-length isoform resolution remains challenging due to the complexity of transcript regulation. Here, we introduce Otari, an attention-based graph neural network framework trained on the human genomic sequence and long-read transcriptomes across 30 tissue types and brain regions. Otari predicts tissue-specific differential isoform abundance by integrating sequence-derived epigenetic and post-transcriptional signals, enabling isoform-resolved variant effect interpretation. Applied to large-scale variant datasets, including an autism cohort, Otari uncovers patterns of isoform dysregulation undetectable at the gene level, such as variant-driven perturbations in isoform abundance and microexon usage implicated in autism pathophysiology. We provide Otari as a resource for powering isoform-level analyses across tissues at scale.
AB - In eukaryotes, most genes produce multiple transcript isoforms that diversify the transcriptome and proteome, serving as a key mechanism of functional regulation. Genetic variation can disrupt the RNA processing signals that shape isoform structure and abundance, yet modeling these effects at full-length isoform resolution remains challenging due to the complexity of transcript regulation. Here, we introduce Otari, an attention-based graph neural network framework trained on the human genomic sequence and long-read transcriptomes across 30 tissue types and brain regions. Otari predicts tissue-specific differential isoform abundance by integrating sequence-derived epigenetic and post-transcriptional signals, enabling isoform-resolved variant effect interpretation. Applied to large-scale variant datasets, including an autism cohort, Otari uncovers patterns of isoform dysregulation undetectable at the gene level, such as variant-driven perturbations in isoform abundance and microexon usage implicated in autism pathophysiology. We provide Otari as a resource for powering isoform-level analyses across tissues at scale.
KW - alternative splicing
KW - attention
KW - autism
KW - graph neural networks
KW - isoforms
KW - long-read RNA-seq
KW - post-transcriptional regulation
KW - transcriptomics
KW - variant effect prediction
UR - https://www.scopus.com/pages/publications/105027665251
UR - https://www.scopus.com/pages/publications/105027665251#tab=citedBy
U2 - 10.1016/j.xgen.2025.101126
DO - 10.1016/j.xgen.2025.101126
M3 - Article
AN - SCOPUS:105027665251
SN - 2666-979X
JO - Cell Genomics
JF - Cell Genomics
M1 - 101126
ER -