TY - JOUR
T1 - Polyketide and nonribosomal peptide retro-biosynthesis and global gene cluster matching
AU - Dejong, Chris A.
AU - Chen, Gregory M.
AU - Li, Haoxin
AU - Johnston, Chad W.
AU - Edwards, McLean R.
AU - Rees, Philip N.
AU - Skinnider, Michael A.
AU - Webster, Andrew L.H.
AU - Magarvey, Nathan A.
N1 - Publisher Copyright:
© 2017 Nature America, Inc., part of Springer Nature. All rights reserved.
PY - 2016/12/1
Y1 - 2016/12/1
N2 - Polyketides (PKs) and nonribosomal peptides (NRPs) are profoundly important natural products, forming the foundations of many therapeutic regimes. Decades of research have revealed over 11,000 PK and NRP structures, and genome sequencing is uncovering new PK and NRP gene clusters at an unprecedented rate. However, only â 1/410% of PK and NRPs are currently associated with gene clusters, and it is unclear how many of these orphan gene clusters encode previously isolated molecules. Therefore, to efficiently guide the discovery of new molecules, we must first systematically de-orphan emergent gene clusters from genomes. Here we provide to our knowledge the first comprehensive retro-biosynthetic program, generalized retro-biosynthetic assembly prediction engine (GRAPE), for PK and NRP families and introduce a computational pipeline, global alignment for natural products cheminformatics (GARLIC), to uncover how observed biosynthetic gene clusters relate to known molecules, leading to the identification of gene clusters that encode new molecules.
AB - Polyketides (PKs) and nonribosomal peptides (NRPs) are profoundly important natural products, forming the foundations of many therapeutic regimes. Decades of research have revealed over 11,000 PK and NRP structures, and genome sequencing is uncovering new PK and NRP gene clusters at an unprecedented rate. However, only â 1/410% of PK and NRPs are currently associated with gene clusters, and it is unclear how many of these orphan gene clusters encode previously isolated molecules. Therefore, to efficiently guide the discovery of new molecules, we must first systematically de-orphan emergent gene clusters from genomes. Here we provide to our knowledge the first comprehensive retro-biosynthetic program, generalized retro-biosynthetic assembly prediction engine (GRAPE), for PK and NRP families and introduce a computational pipeline, global alignment for natural products cheminformatics (GARLIC), to uncover how observed biosynthetic gene clusters relate to known molecules, leading to the identification of gene clusters that encode new molecules.
UR - http://www.scopus.com/inward/record.url?scp=84990060398&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84990060398&partnerID=8YFLogxK
U2 - 10.1038/nchembio.2188
DO - 10.1038/nchembio.2188
M3 - Article
C2 - 27694801
AN - SCOPUS:84990060398
SN - 1552-4450
VL - 12
SP - 1007
EP - 1014
JO - Nature Chemical Biology
JF - Nature Chemical Biology
IS - 12
ER -