A novel method for multiple alignment of sequences with repeated and shuffled elements

Benjamin Raphael, Degui Zhi, Haixu Tang, Pavel Pevzner

Research output: Contribution to journalArticle

81 Scopus citations

Abstract

We describe ABA (A-Bruijn alignment), a new method for multiple alignment of biological sequences. The major difference between ABA and existing multiple alignment methods is that ABA represents an alignment as a directed graph, possibly containing cycles. This representation provides more flexibility than does a traditional alignment matrix or the recently introduced partial order alignment (POA) graph by allowing a larger class of evolutionary relationships between the aligned sequences. Our graph representation is particularly well-suited to the alignment of protein sequences with shuffled and/or repeated domain structure, and allows one to construct multiple alignments of proteins containing (1) domains that are not present in all proteins, (2) domains that are present in different orders in different proteins, and (3) domains that are present in multiple copies in some proteins. In addition, ABA is useful in the alignment of genomic sequences that contain duplications and inversions. We provide several examples illustrating the applications of ABA.

Original languageEnglish (US)
Pages (from-to)2336-2346
Number of pages11
JournalGenome Research
Volume14
Issue number11
DOIs
StatePublished - Nov 2004
Externally publishedYes

All Science Journal Classification (ASJC) codes

  • Genetics
  • Genetics(clinical)

Fingerprint Dive into the research topics of 'A novel method for multiple alignment of sequences with repeated and shuffled elements'. Together they form a unique fingerprint.

  • Cite this