The positive false discovery rate: A Bayesian interpretation and the q-value

Research output: Contribution to journalArticlepeer-review

1239 Scopus citations

Abstract

Multiple hypothesis testing is concerned with controlling the rate of false positives when testing several hypotheses simultaneously. One multiple hypothesis testing error measure is the false discovery rate (FDR), which is loosely defined to be the expected proportion of false positives among all significant hypotheses. The FDR is especially appropriate for exploratory analyses in which one is interested in finding several significant results among many tests. In this work, we introduce a modified version of the FDR called the "positive false discovery rate" (pFDR). We discuss the advantages and disadvantages of the pFDR and investigate its statistical properties. When assuming the test statistics follow a mixture distribution, we show that the pFDR can be written as a Bayesian posterior probability and can be connected to classification theory. These properties remain asymptotically true under fairly general conditions, even under certain forms of dependence. Also, a new quantity called the "q-value" is introduced and investigated, which is a natural "Bayesian posterior p-value," or rather the pFDR analogue of the p-value.

Original languageEnglish (US)
Pages (from-to)2013-2035
Number of pages23
JournalAnnals of Statistics
Volume31
Issue number6
DOIs
StatePublished - Dec 1 2003
Externally publishedYes

All Science Journal Classification (ASJC) codes

  • Statistics and Probability
  • Statistics, Probability and Uncertainty

Keywords

  • Multiple comparisons
  • Simultaneous inference
  • p-values
  • pFDR
  • pFNR
  • q-values

Fingerprint Dive into the research topics of 'The positive false discovery rate: A Bayesian interpretation and the q-value'. Together they form a unique fingerprint.

Cite this