Abstract
Siegmund and Yakir (2000) have given an approximate p-value when two independent, identically distributed sequences from a finite alphabet are optimally aligned based on a scoring system that rewards similarities according to a general scoring matrix and penalizes gaps (insertions and deletions). The approximation involves an infinite sequence of difficult-to-compute parameters. In this paper, it is shown by numerical studies that these reduce to essentially two numerically distinct parameters, which can be computed as one-dimensional numerical integrals. For an arbitrary scoring matrix and affine gap penalty, this modified approximation is easily evaluated. Comparison with published numerical results show that it is reasonably accurate.
| Original language | English (US) |
|---|---|
| Pages (from-to) | 549-556 |
| Number of pages | 8 |
| Journal | Journal of Computational Biology |
| Volume | 8 |
| Issue number | 5 |
| DOIs | |
| State | Published - 2001 |
| Externally published | Yes |
All Science Journal Classification (ASJC) codes
- Computational Mathematics
- Genetics
- Molecular Biology
- Computational Theory and Mathematics
- Modeling and Simulation
Keywords
- Affine gap penalty
- Local alignment
- Markov renewal theory
- p-value