TY - JOUR
T1 - Machine learning methods to model multicellular complexity and tissue specificity
AU - Sealfon, Rachel S.G.
AU - Wong, Aaron K.
AU - Troyanskaya, Olga G.
N1 - Funding Information:
Figures created with Biorender.com. ADAR icon by Emw – Own work, CC BY-SA 3.0, https://commons.wikimedia.org/ w/index.php?curid=8761779. This work is supported by NIH/NIDDK grants U24DK100845, UGDK114907 and U2CDK114886 and NIH grant UH3TR002158 to O.G.T. We thank C. Theesfeld for helpful discussion and comments on the manuscript.
Publisher Copyright:
© 2021, Springer Nature Limited.
PY - 2021/8
Y1 - 2021/8
N2 - Experimental approaches to study tissue specificity enable insight into the nature and organization of the cell types and tissues that constitute complex multicellular organisms. Machine learning provides a powerful tool to investigate and interpret tissue-specific experimental data. In this Review, we first provide a brief introduction to key single-cell and whole-tissue approaches that allow investigation of tissue specificity and then highlight two classes of machine-learning-based methods, which can be applied to analyse, model and interpret these experimental data. Deep learning methods can predict tissue-dependent effects of individual mutations on gene expression, alternative splicing and disease phenotypes. Network-based approaches can capture relationships between biomolecules, integrate large heterogeneous data compendia to model molecular circuits and identify tissue-specific functional relationships and regulatory connections. We conclude with an outlook to future possibilities in examining multicellular complexity by combining high-resolution, large-scale multiomics data sets and interpretable machine learning models.
AB - Experimental approaches to study tissue specificity enable insight into the nature and organization of the cell types and tissues that constitute complex multicellular organisms. Machine learning provides a powerful tool to investigate and interpret tissue-specific experimental data. In this Review, we first provide a brief introduction to key single-cell and whole-tissue approaches that allow investigation of tissue specificity and then highlight two classes of machine-learning-based methods, which can be applied to analyse, model and interpret these experimental data. Deep learning methods can predict tissue-dependent effects of individual mutations on gene expression, alternative splicing and disease phenotypes. Network-based approaches can capture relationships between biomolecules, integrate large heterogeneous data compendia to model molecular circuits and identify tissue-specific functional relationships and regulatory connections. We conclude with an outlook to future possibilities in examining multicellular complexity by combining high-resolution, large-scale multiomics data sets and interpretable machine learning models.
UR - http://www.scopus.com/inward/record.url?scp=85110627377&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85110627377&partnerID=8YFLogxK
U2 - 10.1038/s41578-021-00339-3
DO - 10.1038/s41578-021-00339-3
M3 - Review article
AN - SCOPUS:85110627377
SN - 2058-8437
VL - 6
SP - 717
EP - 729
JO - Nature Reviews Materials
JF - Nature Reviews Materials
IS - 8
ER -