Protein Data Bank: the single global archive for 3D macromolecular structure data

2018 Nucleic Acids Research 1,112 citations

Abstract

The Protein Data Bank (PDB) is the single global archive of experimentally determined three-dimensional (3D) structure data of biological macromolecules. Since 2003, the PDB has been managed by the Worldwide Protein Data Bank (wwPDB; wwpdb.org), an international consortium that collaboratively oversees deposition, validation, biocuration, and open access dissemination of 3D macromolecular structure data. The PDB Core Archive houses 3D atomic coordinates of more than 144 000 structural models of proteins, DNA/RNA, and their complexes with metals and small molecules and related experimental data and metadata. Structure and experimental data/metadata are also stored in the PDB Core Archive using the readily extensible wwPDB PDBx/mmCIF master data format, which will continue to evolve as data/metadata from new experimental techniques and structure determination methods are incorporated by the wwPDB. Impacts of the recently developed universal wwPDB OneDep deposition/validation/biocuration system and various methods-specific wwPDB Validation Task Forces on improving the quality of structures and data housed in the PDB Core Archive are described together with current challenges and future plans.

Keywords

Protein Data BankProtein Data Bank (RCSB PDB)MetadataData bankBiologyComputer scienceData qualityInformation retrievalBioinformaticsComputational biologyWorld Wide WebProtein structureBiochemistryEngineering

Affiliated Institutions

Related Publications

RCSB Protein Data Bank: powerful new tools for exploring 3D structures of biological macromolecules for basic and applied research and education in fundamental biology, biomedicine, biotechnology, bioengineering and energy sciences

Abstract The Research Collaboratory for Structural Bioinformatics Protein Data Bank (RCSB PDB), the US data center for the global PDB archive and a founding member of the Worldw...

2020 Nucleic Acids Research 1465 citations

Publication Info

Year
2018
Type
article
Volume
47
Issue
D1
Pages
D520-D528
Citations
1112
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

1112
OpenAlex

Cite This

Lora Mak, Saqib Mir, Abhik Mukhopadhyay et al. (2018). Protein Data Bank: the single global archive for 3D macromolecular structure data. Nucleic Acids Research , 47 (D1) , D520-D528. https://doi.org/10.1093/nar/gky949

Identifiers

DOI
10.1093/nar/gky949