ADDA: a domain database with global coverage of the protein universe

Andreas Heger; Christopher Wilton; Ashwin Sivakumar; Liisa Holm

doi:10.1093/nar/gki096

Abstract

We used the Automatic Domain Decomposition Algorithm (ADDA) to generate a database of protein domain families with complete coverage of all protein sequences. Sequences are split into domains and domains are grouped into protein domain families in a completely automated process. The current database contains domains for more than 1.5 million sequences in more than 40,000 domain families. In particular, there are 3828 novel domain families that do not overlap with the curated domain databases Pfam, SCOP and InterPro. The data are freely available for downloading and querying via a web interface (http://ekhidna.biocenter.helsinki.fi:9801/sqgraph/pairsdb).

Keywords

BiologyDomain (mathematical analysis)DatabaseProtein domainUploadComputational biologyInterface (matter)Protein–protein interactionBioinformaticsComputer scienceGeneticsWorld Wide Web

Affiliated Institutions

Related Publications

InterPro: the integrative protein signature database

Sarah Hunter , Rolf Apweiler , Teresa K. Attwood +35 more

The InterPro database (http://www.ebi.ac.uk/interpro/) integrates together predictive models or 'signatures' representing protein domains, families and functional sites from mul...

2008 Nucleic Acids Research 2076 citations

The ProDom database of protein domain families: more emphasis on 3D

C. Bru

ProDom is a comprehensive database of protein domain families generated from the global comparison of all available protein sequences. Recent improvements include the use of thr...

2004 Nucleic Acids Research 354 citations

Pfam 10 years on: 10 000 families and still growing

Stephen‐John Sammut , ROBERT FINN , Alex Bateman

Classifications of proteins into groups of related sequences are in some respects like a periodic table for biology, allowing us to understand the underlying molecular biology o...

2008 Briefings in Bioinformatics 128 citations

InterPro in 2022

Typhaine Paysan-Lafosse , Matthias Blum , Sara Chuguransky +23 more

Abstract The InterPro database (https://www.ebi.ac.uk/interpro/) provides an integrative classification of protein sequences into families, and identifies functionally important...

2022 Nucleic Acids Research 2333 citations

The InterPro protein families database: the classification resource after 15 years

Alex Mitchell , Hsin-Yu Chang , Louise C. Daugherty +33 more

The InterPro database (http://www.ebi.ac.uk/interpro/) is a freely available resource that can be used to classify sequences into protein families and to predict the presence of...

2014 Nucleic Acids Research 1265 citations

Publication Info

Year: 2004
Type: article
Volume: 33
Issue: Database issue
Pages: D188-D191
Citations: 48
Access: Closed

External Links

View on DOI.org

Social Impact

Altmetric

ADDA: a domain database with global coverage of the protein universe

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

OpenAlex

Cite This

APA Style

                            
                                
                                    Andreas Heger, 
                                
                                    Christopher Wilton, 
                                
                                    Ashwin Sivakumar
                                
                                et al.
                            
                            (2004). 
                            ADDA: a domain database with global coverage of the protein universe. 
                            Nucleic Acids Research
                            , 33
                            (Database issue)
                            , D188-D191.
                            https://doi.org/10.1093/nar/gki096
                        

Identifiers

DOI: 10.1093/nar/gki096