TY - JOUR
T1 - Inferring the Mutational History of a Tumor Using Multi-state Perfect Phylogeny Mixtures
AU - El-Kebir, Mohammed
AU - Satas, Gryte
AU - Oesper, Layla
AU - Raphael, Benjamin J.
N1 - Publisher Copyright:
© 2016
PY - 2016/7/27
Y1 - 2016/7/27
N2 - Phylogenetic techniques are increasingly applied to infer the somatic mutational history of a tumor from DNA sequencing data. However, standard phylogenetic tree reconstruction techniques do not account for the fact that bulk sequencing data measures mutations in a population of cells. We formulate and solve the multi-state perfect phylogeny mixture deconvolution problem of reconstructing a phylogenetic tree given mixtures of its leaves, under the multi-state perfect phylogeny, or infinite alleles model. Our somatic phylogeny reconstruction using combinatorial enumeration (SPRUCE) algorithm uses this model to construct phylogenetic trees jointly from single-nucleotide variants (SNVs) and copy-number aberrations (CNAs). We show that SPRUCE addresses complexities in simultaneous analysis of SNVs and CNAs. In particular, there are often many possible phylogenetic trees consistent with the data, but the ambiguity decreases considerably with an increasing number of samples. These findings have implications for tumor sequencing strategies, suggest caution in drawing strong conclusions based on a single tree reconstruction, and explain difficulties faced by applying existing phylogenetic techniques to tumor sequencing data.
AB - Phylogenetic techniques are increasingly applied to infer the somatic mutational history of a tumor from DNA sequencing data. However, standard phylogenetic tree reconstruction techniques do not account for the fact that bulk sequencing data measures mutations in a population of cells. We formulate and solve the multi-state perfect phylogeny mixture deconvolution problem of reconstructing a phylogenetic tree given mixtures of its leaves, under the multi-state perfect phylogeny, or infinite alleles model. Our somatic phylogeny reconstruction using combinatorial enumeration (SPRUCE) algorithm uses this model to construct phylogenetic trees jointly from single-nucleotide variants (SNVs) and copy-number aberrations (CNAs). We show that SPRUCE addresses complexities in simultaneous analysis of SNVs and CNAs. In particular, there are often many possible phylogenetic trees consistent with the data, but the ambiguity decreases considerably with an increasing number of samples. These findings have implications for tumor sequencing strategies, suggest caution in drawing strong conclusions based on a single tree reconstruction, and explain difficulties faced by applying existing phylogenetic techniques to tumor sequencing data.
UR - http://www.scopus.com/inward/record.url?scp=84995503537&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84995503537&partnerID=8YFLogxK
U2 - 10.1016/j.cels.2016.07.004
DO - 10.1016/j.cels.2016.07.004
M3 - Article
C2 - 27467246
AN - SCOPUS:84995503537
SN - 2405-4712
VL - 3
SP - 43
EP - 53
JO - Cell Systems
JF - Cell Systems
IS - 1
ER -