Abstract

We describe a new approach to multiple sequence alignment using genetic algorithms and an associated software package called SAGA. The method involves evolving a population of alignments in a quasi evolutionary manner and gradually improving the fitness of the population as measured by an objective function which measures multiple alignment quality. SAGA uses an automatic scheduling scheme to control the usage of 22 different operators for combining alignments or mutating them between generations. When used to optimise the well known sums of pairs objective function, SAGA performs better than some of the widely used alternative packages. This is seen with respect to the ability to achieve an optimal solution and with regard to the accuracy of alignment by comparison with reference alignments based on sequences of known tertiary structure. The general attraction of the approach is the ability to optimise any objective function that one can invent.

Keywords

BiologyMultiple sequence alignmentSequence (biology)PopulationSequence alignmentSoftwareAlignment-free sequence analysisFunction (biology)AlgorithmFitness functionGenetic algorithmComputer scienceGeneticsMachine learningPeptide sequence

Affiliated Institutions

Related Publications

Publication Info

Year
1996
Type
article
Volume
24
Issue
8
Pages
1515-1524
Citations
505
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

505
OpenAlex

Cite This

Cédric Notredame, Desmond G. Higgins (1996). SAGA: sequence alignment by genetic algorithm. Nucleic Acids Research , 24 (8) , 1515-1524. https://doi.org/10.1093/nar/24.8.1515

Identifiers

DOI
10.1093/nar/24.8.1515