Highly accurate protein structure prediction with AlphaFold

2021 Nature 39,627 citations

Abstract

Abstract Proteins are essential to life, and understanding their structure can facilitate a mechanistic understanding of their function. Through an enormous experimental effort 1–4 , the structures of around 100,000 unique proteins have been determined 5 , but this represents a small fraction of the billions of known protein sequences 6,7 . Structural coverage is bottlenecked by the months to years of painstaking effort required to determine a single protein structure. Accurate computational approaches are needed to address this gap and to enable large-scale structural bioinformatics. Predicting the three-dimensional structure that a protein will adopt based solely on its amino acid sequence—the structure prediction component of the ‘protein folding problem’ 8 —has been an important open research problem for more than 50 years 9 . Despite recent progress 10–14 , existing methods fall far short of atomic accuracy, especially when no homologous structure is available. Here we provide the first computational method that can regularly predict protein structures with atomic accuracy even in cases in which no similar structure is known. We validated an entirely redesigned version of our neural network-based model, AlphaFold, in the challenging 14th Critical Assessment of protein Structure Prediction (CASP14) 15 , demonstrating accuracy competitive with experimental structures in a majority of cases and greatly outperforming other methods. Underpinning the latest version of AlphaFold is a novel machine learning approach that incorporates physical and biological knowledge about protein structure, leveraging multi-sequence alignments, into the design of the deep learning algorithm.

Keywords

Protein structure predictionComputer scienceCASPProtein structureThreading (protein sequence)Artificial intelligenceMachine learningStructural bioinformaticsArtificial neural networkProtein superfamilySequence (biology)Function (biology)Computational biologyBiology

MeSH Terms

Amino Acid SequenceComputational BiologyDatabasesProteinDeep LearningModelsMolecularNeural NetworksComputerProtein ConformationProtein FoldingProteinsReproducibility of ResultsSequence Alignment

Affiliated Institutions

Related Publications

Publication Info

Year
2021
Type
article
Volume
596
Issue
7873
Pages
583-589
Citations
39627
Access
Closed

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

39627
OpenAlex
3097
Influential

Cite This

John Jumper, K Taki, Alexander Pritzel et al. (2021). Highly accurate protein structure prediction with AlphaFold. Nature , 596 (7873) , 583-589. https://doi.org/10.1038/s41586-021-03819-2

Identifiers

DOI
10.1038/s41586-021-03819-2
PMID
34265844
PMCID
PMC8371605

Data Quality

Data completeness: 86%