TY - JOUR
T1 - Context-sensitive data integration and prediction of biological networks
AU - Myers, Chad L.
AU - Troyanskaya, Olga G.
N1 - Funding Information:
The authors would like to thank Matt Hibbs, Curtis Huttenhower, Florian Markowetz, and Edo Airoldi for insightful discussions. This research is partially supported by NSF CAREER award DBI-0546275 to OGT, NIH grant R01 GM071966, NIH grant T32 HG003284, and NIGMS Center of Excellence grant P50 GM071508. OGT is an Alfred P. Sloan Research Fellow.
PY - 2007/9/1
Y1 - 2007/9/1
N2 - Motivation: Several recent methods have addressed the problem of heterogeneous data integration and network prediction by modeling the noise inherent in high-throughput genomic datasets, which can dramatically improve specificity and sensitivity and allow the robust integration of datasets with heterogeneous properties. However, experimental technologies capture different biological processes with varying degrees of success, and thus, each source of genomic data can vary in relevance depending on the biological process one is interested in predicting. Accounting for this variation can significantly improve network prediction, but to our knowledge, no previous approaches have explicitly leveraged this critical information about biological context. Results: We confirm the presence of context-dependent variation in functional genomic data and propose a Bayesian approach for context-sensitive integration and query-based recovery of biological process-specific networks. By applying this method to Saccharomyces cerevisiae, we demonstrate that leveraging contextual information can significantly improve the precision of network predictions, including assignment for uncharacterized genes. We expect that this general context-sensitive approach can be applied to other organisms and prediction scenarios.
AB - Motivation: Several recent methods have addressed the problem of heterogeneous data integration and network prediction by modeling the noise inherent in high-throughput genomic datasets, which can dramatically improve specificity and sensitivity and allow the robust integration of datasets with heterogeneous properties. However, experimental technologies capture different biological processes with varying degrees of success, and thus, each source of genomic data can vary in relevance depending on the biological process one is interested in predicting. Accounting for this variation can significantly improve network prediction, but to our knowledge, no previous approaches have explicitly leveraged this critical information about biological context. Results: We confirm the presence of context-dependent variation in functional genomic data and propose a Bayesian approach for context-sensitive integration and query-based recovery of biological process-specific networks. By applying this method to Saccharomyces cerevisiae, we demonstrate that leveraging contextual information can significantly improve the precision of network predictions, including assignment for uncharacterized genes. We expect that this general context-sensitive approach can be applied to other organisms and prediction scenarios.
UR - http://www.scopus.com/inward/record.url?scp=34548749776&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=34548749776&partnerID=8YFLogxK
U2 - 10.1093/bioinformatics/btm332
DO - 10.1093/bioinformatics/btm332
M3 - Article
C2 - 17599939
AN - SCOPUS:34548749776
SN - 1367-4803
VL - 23
SP - 2322
EP - 2330
JO - Bioinformatics
JF - Bioinformatics
IS - 17
ER -