Aligning two sequences within a specified diagonal band

Abstract

We describe an algorithm for aligning two sequences within a diagonal band that requires only O(NW) computation time and O(N) space, where N is the length of the shorter of the two sequences and W is the width of the band. The basic algorithm can be used to calculate either local or global alignment scores. Local alignments are produced by finding the beginning and end of a best local alignment in the band, and then applying the global alignment algorithm between those points. This algorithm has been incorporated into the FASTA program package, where it has decreased the amount of memory required to calculate local alignments from O(NW) to O(N) and decreased the time required to calculate optimized scores for every sequence in a protein sequence database by 40%. On computers with limited memory, such as the IBM-PC, this improvement both allows longer sequences to be aligned and allows optimization within wider bands, which can include longer gaps.

Keywords

DiagonalSequence (biology)ComputationSmith–Waterman algorithmAlgorithmComputer scienceIBMMultiple sequence alignmentSequence alignmentSpace (punctuation)MathematicsPhysicsGeometryBiology

Affiliated Institutions

Related Publications

The CLUSTAL_X windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools

Julie Thompson , Toby J. Gibson , Frédéric Plewniak +2 more

CLUSTAL X is a new windows interface for the widely-used progressive multiple sequence alignment program CLUSTAL W. The new system is easy to use, providing an integrated system...

1997 Nucleic Acids Research 38996 citations

Efficient Kernel Machines Using the Improved Fast Gauss Transform

Changjiang Yang , Ramani Duraiswami , Larry S. Davis

The computation and memory required for kernel machines with N training samples is at least O(N 2). Such a complexity is significant even for moderate size problems and is prohi...

2004 128 citations

The order of sequence alignment can bias the selection of tree topology.

James A. Lake

Sequential pairwise alignment of multiple sequences is a widely used procedure (Kruskal 1983 ).It is useful and generally successful when sequences within a set differ by relati...

1991 Molecular Biology and Evolution 150 citations

DCSE, an interactive tool for sequence alignment and secondary structure research

Peter De Rijk , Rupert De Wächter

DCSE provides a user-friendly package for the creation and editing of sequence alignments. The program runs on different platforms, including microcomputers and workstations. Ap...

1993 Computer applications in the biosciences 251 citations

Informer: Beyond Efficient Transformer for Long Sequence Time-Series Forecasting

Haoyi Zhou , Shanghang Zhang , Jieqi Peng +4 more

Many real-world applications require the prediction of long sequence time-series, such as electricity consumption planning. Long sequence time-series forecasting (LSTF) demands ...

2021 Proceedings of the AAAI Conference on... 4749 citations

Publication Info

Year: 1992
Type: article
Volume: 8
Issue: 5
Pages: 481-487
Citations: 165
Access: Closed

External Links

View on DOI.org

Social Impact

Altmetric

Aligning two sequences within a specified diagonal band

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

165

OpenAlex

Cite This

APA Style

                            
                                    Kun‐Mao Chao, 
                                
                                    William R. Pearson, 
                                
                                    Webb Miller
                                
                            (1992). 
                            Aligning two sequences within a specified diagonal band. 
                            Computer applications in the biosciences
                            , 8
                            (5)
                            , 481-487.
                            https://doi.org/10.1093/bioinformatics/8.5.481

Identifiers

DOI: 10.1093/bioinformatics/8.5.481