Robust deep learning–based protein sequence design using ProteinMPNN

2022 Science 1,362 citations

Abstract

Although deep learning has revolutionized protein structure prediction, almost all experimentally characterized de novo protein designs have been generated using physically based approaches such as Rosetta. Here, we describe a deep learning–based protein sequence design method, ProteinMPNN, that has outstanding performance in both in silico and experimental tests. On native protein backbones, ProteinMPNN has a sequence recovery of 52.4% compared with 32.9% for Rosetta. The amino acid sequence at different positions can be coupled between single or multiple chains, enabling application to a wide range of current protein design challenges. We demonstrate the broad utility and high accuracy of ProteinMPNN using x-ray crystallography, cryo–electron microscopy, and functional studies by rescuing previously failed designs, which were made using Rosetta or AlphaFold, of protein monomers, cyclic homo-oligomers, tetrahedral nanoparticles, and target-binding proteins.

Keywords

Protein designIn silicoDeep learningSequence (biology)Protein sequencingComputational biologyPeptide sequenceComputer scienceProtein structure predictionProtein structureArtificial intelligenceChemistryBiophysicsBiochemistryBiologyGene

Affiliated Institutions

Related Publications

Publication Info

Year
2022
Type
article
Volume
378
Issue
6615
Pages
49-56
Citations
1362
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

1362
OpenAlex

Cite This

Justas Dauparas, Ivan Anishchenko, Nathaniel R. Bennett et al. (2022). Robust deep learning–based protein sequence design using ProteinMPNN. Science , 378 (6615) , 49-56. https://doi.org/10.1126/science.add2187

Identifiers

DOI
10.1126/science.add2187