UniProt: the universal protein knowledgebase in 2021

Alex Bateman , María Martin , Sandra Orchard , Alex Bateman , María Martin , Sandra Orchard , Michele Magrane , Rahat Agivetova , Shadab Ahmad , Emanuele Alpi , Emily Bowler-Barnett , Ramona Britto , Borisas Bursteinas , Hema Bye‐A‐Jee , Ray Coetzee , Austra Cukura , Alan Da Silva , Paul Denny , Tunca Doğan , ThankGod E. Ebenezer , Jun Fan , Leyla Jael Castro , Penelope Garmiri , George P. Georghiou , Leonardo Jose da Costa Gonzales , Emma Hatton-Ellis , Abdulrahman Hussein , Alexandr Ignatchenko , Giuseppe Insana , Rizwan Ishtiaq , Petteri Jokinen , Vishal Joshi , Dushyanth Jyothi , Antonia Lock , Rodrigo López , Aurélien Luciani , Jie Luo , Yvonne Lussi , Alistair MacDougall , Fábio Madeira , Mahdi Mahmoudy , M. Menchi , Alok Mishra , Katie Moulang , Andrew Nightingale , Carla Susana Oliveira , Sangya Pundir , Guoying Qi , Shriya Raj , Daniel L Rice , M. Rodríguez-López , Rabie Saidi , J. H. Sampson , Tony Sawford , Elena Speretta , E. B. Turner , Nidhi Tyagi , Preethi Vasudev , Vladimir Volynkin , Kate Warner , Xavier Watkins , Rossana Zaru , Hermann Zellner , Alan Bridge , Sylvain Poux , Nicole Redaschi , Lucila Aimo , Ghislaine Argoud‐Puy , Andrea H Auchincloss , Kristian B. Axelsen , Parit Bansal , Delphine Baratin , Marie-Claude Blatter , Jerven Bolleman , Emmanuel Boutet , Lionel Breuza , Cristina Casals‐Casas , Leyla Jael Castro , Kamal Chikh Echioukh , Elisabeth Coudert , Beatrice Cuche , Mikael Doche , Dolnide Dornevil , Anne Estreicher , Maria Livia Famiglietti , Marc Feuermann , Elisabeth Gasteiger , Sébastien Géhant , Vivienne Baillie Gerritsen , Arnaud Gos , Nadine Gruaz-Gumowski , Ursula Hinz , Chantal Hulo , Nevila Hyka‐Nouspikel , Florence Jungo , G. Keller , Arnaud Kerhornou , V. Lara , Philippe Le Mercier , Damien Lieberherr , Thierry Lombardot , Xavier Martín , Patrick Masson
2020 Nucleic Acids Research 6,740 citations

Abstract

Abstract The aim of the UniProt Knowledgebase is to provide users with a comprehensive, high-quality and freely accessible set of protein sequences annotated with functional information. In this article, we describe significant updates that we have made over the last two years to the resource. The number of sequences in UniProtKB has risen to approximately 190 million, despite continued work to reduce sequence redundancy at the proteome level. We have adopted new methods of assessing proteome completeness and quality. We continue to extract detailed annotations from the literature to add to reviewed entries and supplement these in unreviewed entries with annotations provided by automated systems such as the newly implemented Association-Rule-Based Annotator (ARBA). We have developed a credit-based publication submission interface to allow the community to contribute publications and annotations to UniProt entries. We describe how UniProtKB responded to the COVID-19 pandemic through expert curation of relevant entries that were rapidly made available to the research community through a dedicated portal. UniProt resources are available under a CC-BY (4.0) license via the web at https://www.uniprot.org/.

Keywords

UniProtBiologyHuman proteome projectProteomeEnsemblComputer scienceWorld Wide WebComputational biologyBioinformaticsGenomeGenomicsProteomicsGenetics

MeSH Terms

COVID-19Computational BiologyData CurationDatabasesProteinHumansInternetKnowledge BasesMolecular Sequence AnnotationPandemicsProteomeProteomicsSARS-CoV-2User-Computer InterfaceViral Proteins

Related Publications

Publication Info

Year
2020
Type
article
Volume
49
Issue
D1
Pages
D480-D489
Citations
6740
Access
Closed

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

6740
OpenAlex
321
Influential
5718
CrossRef

Cite This

Alex Bateman, María Martin, Sandra Orchard et al. (2020). UniProt: the universal protein knowledgebase in 2021. Nucleic Acids Research , 49 (D1) , D480-D489. https://doi.org/10.1093/nar/gkaa1100

Identifiers

DOI
10.1093/nar/gkaa1100
PMID
33237286
PMCID
PMC7778908

Data Quality

Data completeness: 90%