Abstract

Abstract Several choices of amino acid substitution matrices are currently available for searching and alignment applications. These choices were evaluated using the BLAST searching program, which is extremely sensitive to differences among matrices, and the Prosite catalog, which lists members of hundreds of protein families. Matrices derived directly from either sequence‐based or structurebased alignments of distantly related proteins performed much better overall than extrapolated matrices based on the Dayhoff evolutionary model. Similar results were obtained with the FASTA searching program. Improved performance appears to be general rather than family‐specific, reflecting improved accuracy in scoring alignments. An implementation of a multiple matrix strategy was also tested. While no combination of three matrices performed as well as the single best matrix, BLOSUM 62, good results were obtained using a combination of sequence‐based and structure‐based matrices. This hybrid set of matrices is likely to be useful in certain situations. Our results illustrate the importance of matrix selection and value of a comprehensive approach to evaluation of protein comparison tools. © 1993 Wiley‐Liss, Inc.

Keywords

Substitution (logic)Set (abstract data type)Sequence (biology)Computer scienceMatrix (chemical analysis)Selection (genetic algorithm)Sequence alignmentComputational biologyAmino acid substitutionData miningAlgorithmPeptide sequenceArtificial intelligenceBiologyGeneticsChemistryGeneProgramming languageMutation

Affiliated Institutions

Related Publications

Publication Info

Year
1993
Type
article
Volume
17
Issue
1
Pages
49-61
Citations
441
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

441
OpenAlex

Cite This

Steven Henikoff, Jorja G. Henikoff (1993). Performance evaluation of amino acid substitution matrices. Proteins Structure Function and Bioinformatics , 17 (1) , 49-61. https://doi.org/10.1002/prot.340170108

Identifiers

DOI
10.1002/prot.340170108