Abstract

Profile analysis is a method for detecting distantly related proteins by sequence comparison. The basis for comparison is not only the customary Dayhoff mutational-distance matrix but also the results of structural studies and information implicit in the alignments of the sequences of families of similar proteins. This information is expressed in a position-specific scoring table (profile), which is created from a group of sequences previously aligned by structural or sequence similarity. The similarity of any other sequence (target) to the group of aligned sequences (probe) can be tested by comparing the target to the profile using dynamic programming algorithms. The profile method differs in two major respects from methods of sequence comparison in common use: (i) Any number of known sequences can be used to construct the profile, allowing more information to be used in the testing of the target than is possible with pairwise alignment methods. (ii) The profile includes the penalties for insertion or deletion at each position, which allow one to include the probe secondary structure in the testing scheme. Tests with globin and immunoglobulin sequences show that profile analysis can distinguish all members of these families from all other sequences in a database containing 3800 protein sequences.

Keywords

Sequence (biology)Computational biologyPairwise comparisonMultiple sequence alignmentStructural alignmentSequence alignmentSequence analysisSimilarity (geometry)GeneticsBiologyDynamic programmingComputer scienceBioinformaticsPattern recognition (psychology)Peptide sequenceAlgorithmArtificial intelligenceGene

Affiliated Institutions

Related Publications

Profile hidden Markov models.

Abstract The recent literature on profile hidden Markov model (profile HMM) methods and software is reviewed. Profile HMMs turn a multiple sequence alignment into a position-spe...

1998 Bioinformatics 5657 citations

Publication Info

Year
1987
Type
article
Volume
84
Issue
13
Pages
4355-4358
Citations
1321
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

1321
OpenAlex

Cite This

Michael Gribskov, A. McLachlan, David Eisenberg (1987). Profile analysis: detection of distantly related proteins.. Proceedings of the National Academy of Sciences , 84 (13) , 4355-4358. https://doi.org/10.1073/pnas.84.13.4355

Identifiers

DOI
10.1073/pnas.84.13.4355