Abstract

The Carbohydrate-Active Enzyme (CAZy) database is a knowledge-based resource specialized in the enzymes that build and breakdown complex carbohydrates and glycoconjugates. As of September 2008, the database describes the present knowledge on 113 glycoside hydrolase, 91 glycosyltransferase, 19 polysaccharide lyase, 15 carbohydrate esterase and 52 carbohydrate-binding module families. These families are created based on experimentally characterized proteins and are populated by sequences from public databases with significant similarity. Protein biochemical information is continuously curated based on the available literature and structural information. Over 6400 proteins have assigned EC numbers and 700 proteins have a PDB structure. The classification (i) reflects the structural features of these enzymes better than their sole substrate specificity, (ii) helps to reveal the evolutionary relationships between these enzymes and (iii) provides a convenient framework to understand mechanistic properties. This resource has been available for over 10 years to the scientific community, contributing to information dissemination and providing a transversal nomenclature to glycobiologists. More recently, this resource has been used to improve the quality of functional predictions of a number genome projects by providing expert annotation. The CAZy resource resides at URL: http://www.cazy.org/.

Keywords

BiologyGlycoside hydrolaseProtein Data Bank (RCSB PDB)GlycoconjugateDatabaseBiochemistryComputational biologyEnzymeEsteraseAnnotationGeneticsComputer science

Affiliated Institutions

Related Publications

Publication Info

Year
2008
Type
article
Volume
37
Issue
Database
Pages
D233-D238
Citations
5788
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

5788
OpenAlex

Cite This

Brandi L. Cantarel, P. M. Coutinho, Corinne Rancurel et al. (2008). The Carbohydrate-Active EnZymes database (CAZy): an expert resource for Glycogenomics. Nucleic Acids Research , 37 (Database) , D233-D238. https://doi.org/10.1093/nar/gkn663

Identifiers

DOI
10.1093/nar/gkn663