Abstract
Functional similarity based on Gene Ontology (GO) annotation is used in diverse applications like gene clustering, gene expression data analysis, protein interaction prediction and evaluation. However, there exists no comprehensive resource of functional similarity values although such a database would facilitate the use of functional similarity measures in different applications. Here, we describe FunSimMat (Functional Similarity Matrix, http://funsimmat.bioinf.mpi-inf.mpg.de/), a large new database that provides several different semantic similarity measures for GO terms. It offers various precomputed functional similarity values for proteins contained in UniProtKB and for protein families in Pfam and SMART. The web interface allows users to efficiently perform both semantic similarity searches with GO terms and functional similarity searches with proteins or protein families. All results can be downloaded in tab-delimited files for use with other tools. An additional XML-RPC interface gives automatic online access to FunSimMat for programs and remote services.
Keywords
Affiliated Institutions
Related Publications
PIRSF family classification system for protein functional and evolutionary analysis.
The PIRSF protein classification system (http://pir.georgetown.edu/pirsf/) reflects evolutionary relationships of full-length proteins and domains. The primary PIRSF classificat...
Functional evaluation of domain–domain interactions and human protein interaction networks
Abstract Motivation: Large amounts of protein and domain interaction data are being produced by experimental high-throughput techniques and computational approaches. To gain ins...
UniRef clusters: a comprehensive and scalable alternative for improving sequence similarity searches
Abstract Motivation: UniRef databases provide full-scale clustering of UniProtKB sequences and are utilized for a broad range of applications, particularly similarity-based func...
Large-Scale Protein Annotation through Gene Ontology
Recent progress in genomic sequencing, computational biology, and ontology development has presented an opportunity to investigate biological systems from a unique perspective, ...
The InterPro protein families and domains database: 20 years on
Abstract The InterPro database (https://www.ebi.ac.uk/interpro/) provides an integrative classification of protein sequences into families, and identifies functionally important...
Publication Info
- Year
- 2007
- Type
- article
- Volume
- 36
- Issue
- Database
- Pages
- D434-D439
- Citations
- 93
- Access
- Closed
External Links
Social Impact
Social media, news, blog, policy document mentions
Citation Metrics
Cite This
Identifiers
- DOI
- 10.1093/nar/gkm806