PIRSF: family classification system at the Protein Information Resource

2003 Nucleic Acids Research 232 citations

Abstract

The Protein Information Resource (PIR) is an integrated public resource of protein informatics. To facilitate the sensible propagation and standardization of protein annotation and the systematic detection of annotation errors, PIR has extended its superfamily concept and developed the SuperFamily (PIRSF) classification system. Based on the evolutionary relationships of whole proteins, this classification system allows annotation of both specific biological and generic biochemical functions. The system adopts a network structure for protein classification from superfamily to subfamily levels. Protein family members are homologous (sharing common ancestry) and homeomorphic (sharing full-length sequence similarity with common domain architecture). The PIRSF database consists of two data sets, preliminary clusters and curated families. The curated families include family name, protein membership, parent-child relationship, domain architecture, and optional description and bibliography. PIRSF is accessible from the website at http://pir.georgetown.edu/pirsf/ for report retrieval and sequence classification. The report presents family annotation, membership statistics, cross-references to other databases, graphical display of domain architecture, and links to multiple sequence alignments and phylogenetic trees for curated families. PIRSF can be utilized to analyze phylogenetic profiles, to reveal functional convergence and divergence, and to identify interesting relationships between homeomorphic families, domains and structural classes.

Keywords

AnnotationUniProtBiologyStructural Classification of Proteins databasePhylogenetic treeProtein familyProtein domainProtein superfamilySequence alignmentSubfamilyDomain (mathematical analysis)Protein sequencingComputational biologyResource (disambiguation)Information retrievalGeneticsComputer sciencePeptide sequenceProtein structureGene

Affiliated Institutions

Related Publications

Publication Info

Year
2003
Type
article
Volume
32
Issue
90001
Pages
112D-114
Citations
232
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

232
OpenAlex

Cite This

Cathy Wu, A. N. NIKOL'SKAYA, Hongzhan Huang et al. (2003). PIRSF: family classification system at the Protein Information Resource. Nucleic Acids Research , 32 (90001) , 112D-114. https://doi.org/10.1093/nar/gkh097

Identifiers

DOI
10.1093/nar/gkh097