Abstract
We develop a general framework for performing large-scale significance testing in the presence of arbitrarily strong dependence. We derive a low-dimensional set of random vectors, called a dependence kernel, that fully captures the dependence structure in an observed high-dimensional dataset. This result shows a surprising reversal of the "curse of dimensionality" in the high-dimensional hypothesis testing setting. We show theoretically that conditioning on a dependence kernel is sufficient to render statistical tests independent regardless of the level of dependence in the observed data. This framework for multiple testing dependence has implications in a variety of common multiple testing problems, such as in gene expression studies, brain imaging, and spatial epidemiology.
Original language | English (US) |
---|---|
Pages (from-to) | 18718-18723 |
Number of pages | 6 |
Journal | Proceedings of the National Academy of Sciences of the United States of America |
Volume | 105 |
Issue number | 48 |
DOIs | |
State | Published - Dec 2 2008 |
All Science Journal Classification (ASJC) codes
- General
Keywords
- Empirical null
- False discovery rate
- Latent structure
- Simultaneous inference
- Surrogate variable analysis