Abstract

For aligning DNA sequences that differ only by sequencing errors, or by equivalent errors from other sources, a greedy algorithm can be much faster than traditional dynamic programming approaches and yet produce an alignment that is guaranteed to be theoretically optimal. We introduce a new greedy alignment algorithm with particularly good performance and show that it computes the same alignment as does a certain dynamic programming algorithm, while executing over 10 times faster on appropriate data. An implementation of this algorithm is currently used in a program that assembles the UniGene database at the National Center for Biotechnology Information.

Keywords

Greedy algorithmDynamic programmingComputer scienceAlgorithmUniGeneBiologyGenomeGenetics

Affiliated Institutions

Related Publications

Publication Info

Year
2000
Type
article
Volume
7
Issue
1-2
Pages
203-214
Citations
5473
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

5473
OpenAlex

Cite This

Zheng Zhang, Scott Schwartz, Lukas Wagner et al. (2000). A Greedy Algorithm for Aligning DNA Sequences. Journal of Computational Biology , 7 (1-2) , 203-214. https://doi.org/10.1089/10665270050081478

Identifiers

DOI
10.1089/10665270050081478