Touring protein fold space with Dali/FSSP

1998 Nucleic Acids Research 667 citations

Abstract

The FSSP database and its new supplement, the Dali Domain Dictionary, present a continuously updated classification of all known 3D protein structures. The classification is derived using an automatic structure alignment program (Dali) for the all-against-all comparison of structures in the Protein Data Bank. From the resulting enumeration of structural neighbours (which form a surprisingly continuous distribution in fold space) we derive a discrete fold classification in three steps: (i) sequence-related families are covered by a representative set of protein chains; (ii) protein chains are decomposed into structural domains based on the recurrence of structural motifs; (iii) folds are defined as tight clusters of domains in fold space. The fold classification, domain definitions and test sets for sequence-structure alignment (threading) are accessible on the web at www.embl-ebi.ac.uk/dali . The web interface provides a rich network of links between neighbours in fold space, between domains and proteins, and between structures and sequences leading, for example, to a database of explicit multiple alignments of protein families in the twilight zone of sequence similarity. The Dali/FSSP organization of protein structures provides a map of the currently known regions of the protein universe that is useful for the analysis of folding principles, for the evolutionary unification of protein families and for maximizing the information return from experimental structure determination.

Keywords

Threading (protein sequence)Structural Classification of Proteins databaseProtein Data BankStructural alignmentFold (higher-order function)Protein structure databaseBiologyProtein structure predictionProtein domainProtein structureProtein foldingComputational biologySequence alignmentProtein familyProtein designStructural bioinformaticsSequence (biology)Computer sciencePeptide sequenceGeneticsSequence database

MeSH Terms

Computer Communication NetworksDatabasesFactualInformation Storage and RetrievalProtein ConformationProtein FoldingProteins

Affiliated Institutions

Related Publications

Publication Info

Year
1998
Type
article
Volume
26
Issue
1
Pages
316-319
Citations
667
Access
Closed

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

667
OpenAlex
47
Influential
527
CrossRef

Cite This

Liisa Holm (1998). Touring protein fold space with Dali/FSSP. Nucleic Acids Research , 26 (1) , 316-319. https://doi.org/10.1093/nar/26.1.316

Identifiers

DOI
10.1093/nar/26.1.316
PMID
9399863
PMCID
PMC147193

Data Quality

Data completeness: 86%