Abstract

We describe an algorithm for aligning two sequences within a diagonal band that requires only O(NW) computation time and O(N) space, where N is the length of the shorter of the two sequences and W is the width of the band. The basic algorithm can be used to calculate either local or global alignment scores. Local alignments are produced by finding the beginning and end of a best local alignment in the band, and then applying the global alignment algorithm between those points. This algorithm has been incorporated into the FASTA program package, where it has decreased the amount of memory required to calculate local alignments from O(NW) to O(N) and decreased the time required to calculate optimized scores for every sequence in a protein sequence database by 40%. On computers with limited memory, such as the IBM-PC, this improvement both allows longer sequences to be aligned and allows optimization within wider bands, which can include longer gaps.

Keywords

DiagonalSequence (biology)ComputationSmith–Waterman algorithmAlgorithmComputer scienceIBMMultiple sequence alignmentSequence alignmentSpace (punctuation)MathematicsPhysicsGeometryBiology

Affiliated Institutions

Related Publications

Publication Info

Year
1992
Type
article
Volume
8
Issue
5
Pages
481-487
Citations
165
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

165
OpenAlex

Cite This

Kun‐Mao Chao, William R. Pearson, Webb Miller (1992). Aligning two sequences within a specified diagonal band. Computer applications in the biosciences , 8 (5) , 481-487. https://doi.org/10.1093/bioinformatics/8.5.481

Identifiers

DOI
10.1093/bioinformatics/8.5.481