Abstract

We used the Automatic Domain Decomposition Algorithm (ADDA) to generate a database of protein domain families with complete coverage of all protein sequences. Sequences are split into domains and domains are grouped into protein domain families in a completely automated process. The current database contains domains for more than 1.5 million sequences in more than 40,000 domain families. In particular, there are 3828 novel domain families that do not overlap with the curated domain databases Pfam, SCOP and InterPro. The data are freely available for downloading and querying via a web interface (http://ekhidna.biocenter.helsinki.fi:9801/sqgraph/pairsdb).

Keywords

BiologyDomain (mathematical analysis)DatabaseProtein domainUploadComputational biologyInterface (matter)Protein–protein interactionBioinformaticsComputer scienceGeneticsWorld Wide Web

Affiliated Institutions

Related Publications

Publication Info

Year
2004
Type
article
Volume
33
Issue
Database issue
Pages
D188-D191
Citations
48
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

48
OpenAlex

Cite This

Andreas Heger, Christopher Wilton, Ashwin Sivakumar et al. (2004). ADDA: a domain database with global coverage of the protein universe. Nucleic Acids Research , 33 (Database issue) , D188-D191. https://doi.org/10.1093/nar/gki096

Identifiers

DOI
10.1093/nar/gki096