TY - GEN
T1 - A parsimony approach to analysis of human segmental duplications
AU - Kahn, Crystal L.
AU - Raphael, Benjamin J.
PY - 2009
Y1 - 2009
N2 - Segmental duplications are abundant in the human genome, but their evolutionary history is not well-understood. The mystery surrounding them is due in part to their complex organization; many segmental duplications are mosaic patterns of smaller repeated segments, or duplicons. A two-step model of duplication has been proposed to explain these mosaic patterns. In this model, duplicons are copied and aggregated into primary duplication blocks that subsequently seed secondary duplications. Here, we formalize the problem of computing a duplication scenario that is consistent with the two-step model. We first describe a dynamic programming algorithm to compute the duplication distance between two strings. We then use this distance as the cost function in an integer linear program to obtain the most parsimonious duplication scenario. We apply our method to derive putative ancestral relationships between segmental duplications in the human genome.
AB - Segmental duplications are abundant in the human genome, but their evolutionary history is not well-understood. The mystery surrounding them is due in part to their complex organization; many segmental duplications are mosaic patterns of smaller repeated segments, or duplicons. A two-step model of duplication has been proposed to explain these mosaic patterns. In this model, duplicons are copied and aggregated into primary duplication blocks that subsequently seed secondary duplications. Here, we formalize the problem of computing a duplication scenario that is consistent with the two-step model. We first describe a dynamic programming algorithm to compute the duplication distance between two strings. We then use this distance as the cost function in an integer linear program to obtain the most parsimonious duplication scenario. We apply our method to derive putative ancestral relationships between segmental duplications in the human genome.
UR - http://www.scopus.com/inward/record.url?scp=61949248723&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=61949248723&partnerID=8YFLogxK
M3 - Conference contribution
C2 - 19213134
AN - SCOPUS:61949248723
SN - 9812836926
SN - 9789812836922
T3 - Pacific Symposium on Biocomputing 2009, PSB 2009
SP - 126
EP - 137
BT - Pacific Symposium on Biocomputing 2009, PSB 2009
T2 - 14th Pacific Symposium on Biocomputing, PSB 2009
Y2 - 5 January 2009 through 9 January 2009
ER -