TY - JOUR
T1 - Cross-dimensional inference of dependent high-dimensional data
AU - Desai, Keyur H.
AU - Storey, John D.
N1 - Funding Information:
Keyur H. Desai is Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ 08544 (E-mail: [email protected]). John D. Storey is a professor in the Lewis-Sigler Institute for Integrative Genomics and the Department of Molecular Biology, Princeton University, Princeton, NJ 08544 (E-mail: [email protected]). This research was supported in part by the National Institutes of Health (NIH) grant R01 HG002913.
PY - 2012
Y1 - 2012
N2 - Agrowing number ofmodern scientific problems in areas such as genomics, neurobiology, and spatial epidemiology involve the measurement and analysis of thousands of related features that may be stochastically dependent at arbitrarily strong levels. In this work, we consider the scenario where the features follow a multivariate Normal distribution. We demonstrate that dependence is manifested as random variation shared among features, and that standard methods may yield highly unstable inference due to dependence, even when the dependence is fully parameterized and utilized in the procedure.We propose a "cross-dimensional inference" framework that alleviates the problems due to dependence by modeling and removing the variation shared among features, while also properly regularizing estimation across features.We demonstrate the framework on both simultaneous point estimation and multiple hypothesis testing in scenarios derived from the scientific applications of interest.
AB - Agrowing number ofmodern scientific problems in areas such as genomics, neurobiology, and spatial epidemiology involve the measurement and analysis of thousands of related features that may be stochastically dependent at arbitrarily strong levels. In this work, we consider the scenario where the features follow a multivariate Normal distribution. We demonstrate that dependence is manifested as random variation shared among features, and that standard methods may yield highly unstable inference due to dependence, even when the dependence is fully parameterized and utilized in the procedure.We propose a "cross-dimensional inference" framework that alleviates the problems due to dependence by modeling and removing the variation shared among features, while also properly regularizing estimation across features.We demonstrate the framework on both simultaneous point estimation and multiple hypothesis testing in scenarios derived from the scientific applications of interest.
KW - Dependent data
KW - False discovery rate
KW - High-dimensional biology
KW - Multiple hypothesis testing
KW - Simultaneous inference
UR - http://www.scopus.com/inward/record.url?scp=84862880410&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84862880410&partnerID=8YFLogxK
U2 - 10.1080/01621459.2011.645777
DO - 10.1080/01621459.2011.645777
M3 - Article
C2 - 38505664
AN - SCOPUS:84862880410
SN - 0162-1459
VL - 107
SP - 135
EP - 151
JO - Journal of the American Statistical Association
JF - Journal of the American Statistical Association
IS - 497
ER -