Abstract

Abstract Motivation: Consensus sequence generation is important in many kinds of sequence analysis ranging from sequence assembly to profile-based iterative search methods. However, how can a consensus be constructed when its inherent assumption—that the aligned sequences form a single linear consensus—is not true? Results: Partial Order Alignment (POA) enables construction and analysis of multiple sequence alignments as directed acyclic graphs containing complex branching structure. Here we present a dynamic programming algorithm (heaviest_bundle) for generating multiple consensus sequences from such complex alignments. The number and relationships of these consensus sequences reveals the degree of structural complexity of the source alignment. This is a powerful and general approach for analyzing and visualizing complex alignment structures, and can be applied to any alignment. We illustrate its value for analyzing expressed sequence alignments to detect alternative splicing, reconstruct full length mRNA isoform sequences from EST fragments, and separate paralog mixtures that can cause incorrect SNP predictions. Availability: The heaviest_bundle source code is available at http://www.bioinformatics.ucla.edu/poa Contact: leec@mbi.ucla.edu * To whom correspondence should be addressed.

Keywords

Sequence (biology)Multiple sequence alignmentConsensus sequenceOrder (exchange)CombinatoricsComputer scienceMathematicsSequence alignmentBiologyGeneticsBase sequencePeptide sequence

Affiliated Institutions

Related Publications

Publication Info

Year
2003
Type
article
Volume
19
Issue
8
Pages
999-1008
Citations
99
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

99
OpenAlex

Cite This

Christopher J. Lee (2003). Generating consensus sequences from partialorder multiple sequence alignment graphs. Bioinformatics , 19 (8) , 999-1008. https://doi.org/10.1093/bioinformatics/btg109

Identifiers

DOI
10.1093/bioinformatics/btg109