Abstract
Automated methods for assigning peptides to observed tandem mass spectra typically return a list of peptide-spectrum matches, ranked according to an arbitrary score. In this article, we describe methods for converting these arbitrary scores into more useful statistical significance measures. These methods employ a decoy sequence database as a model of the null hypothesis, and use false discovery rate (FDR) analysis to correct for multiple testing. We first describe a simple FDR inference method and then describe how estimating and taking into account the percentage of incorrectly identified spectra in the entire data set can lead to increased statistical power.
Original language | English (US) |
---|---|
Pages (from-to) | 29-34 |
Number of pages | 6 |
Journal | Journal of Proteome Research |
Volume | 7 |
Issue number | 1 |
DOIs | |
State | Published - Jan 2008 |
Externally published | Yes |
All Science Journal Classification (ASJC) codes
- General Chemistry
- Biochemistry
Keywords
- Decoy database
- False discovery rate
- Peptide identification
- Statistical significance
- q-value