TY - JOUR
T1 - Peak Annotation and Verification Engine for Untargeted LC-MS Metabolomics
AU - Wang, Lin
AU - Xing, Xi
AU - Chen, Li
AU - Yang, Lifeng
AU - Su, Xiaoyang
AU - Rabitz, Herschel
AU - Lu, Wenyun
AU - Rabinowitz, Joshua D.
N1 - Publisher Copyright:
© 2018 American Chemical Society.
PY - 2019/2/5
Y1 - 2019/2/5
N2 - Untargeted metabolomics can detect more than 10※000 peaks in a single LC-MS run. The correspondence between these peaks and metabolites, however, remains unclear. Here, we introduce a Peak Annotation and Verification Engine (PAVE) for annotating untargeted microbial metabolomics data. The workflow involves growing cells in 13 C and 15 N isotope-labeled media to identify peaks from biological compounds and their carbon and nitrogen atom counts. Improved deisotoping and deadducting are enabled by algorithms that integrate positive mode, negative mode, and labeling data. To distinguish metabolites and their fragments, PAVE experimentally measures the response of each peak to weak in-source collision induced dissociation, which increases the peak intensity for fragments while decreasing it for their parent ions. The molecular formulas of the putative metabolites are then assigned based on database searching using both m/z and C/N atom counts. Application of this procedure to Saccharomyces cerevisiae and Escherichia coli revealed that more than 80% of peaks do not label, i.e., are environmental contaminants. More than 70% of the biological peaks are isotopic variants, adducts, fragments, or mass spectrometry artifacts yielding â2000 apparent metabolites across the two organisms. About 650 match to a known metabolite formula based on m/z and C/N atom counts, with 220 assigned structures based on MS/MS and/or retention time to match to authenticated standards. Thus, PAVE enables systematic annotation of LC-MS metabolomics data with only â4% of peaks annotated as apparent metabolites.
AB - Untargeted metabolomics can detect more than 10※000 peaks in a single LC-MS run. The correspondence between these peaks and metabolites, however, remains unclear. Here, we introduce a Peak Annotation and Verification Engine (PAVE) for annotating untargeted microbial metabolomics data. The workflow involves growing cells in 13 C and 15 N isotope-labeled media to identify peaks from biological compounds and their carbon and nitrogen atom counts. Improved deisotoping and deadducting are enabled by algorithms that integrate positive mode, negative mode, and labeling data. To distinguish metabolites and their fragments, PAVE experimentally measures the response of each peak to weak in-source collision induced dissociation, which increases the peak intensity for fragments while decreasing it for their parent ions. The molecular formulas of the putative metabolites are then assigned based on database searching using both m/z and C/N atom counts. Application of this procedure to Saccharomyces cerevisiae and Escherichia coli revealed that more than 80% of peaks do not label, i.e., are environmental contaminants. More than 70% of the biological peaks are isotopic variants, adducts, fragments, or mass spectrometry artifacts yielding â2000 apparent metabolites across the two organisms. About 650 match to a known metabolite formula based on m/z and C/N atom counts, with 220 assigned structures based on MS/MS and/or retention time to match to authenticated standards. Thus, PAVE enables systematic annotation of LC-MS metabolomics data with only â4% of peaks annotated as apparent metabolites.
UR - http://www.scopus.com/inward/record.url?scp=85060022721&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85060022721&partnerID=8YFLogxK
U2 - 10.1021/acs.analchem.8b03132
DO - 10.1021/acs.analchem.8b03132
M3 - Article
C2 - 30586294
AN - SCOPUS:85060022721
SN - 0003-2700
VL - 91
SP - 1838
EP - 1846
JO - Analytical Chemistry
JF - Analytical Chemistry
IS - 3
ER -