A Deep Learning Approach to Antibiotic Discovery

2020 Cell 1,900 citations

Abstract

Due to the rapid emergence of antibiotic-resistant bacteria, there is a growing need to discover new antibiotics. To address this challenge, we trained a deep neural network capable of predicting molecules with antibacterial activity. We performed predictions on multiple chemical libraries and discovered a molecule from the Drug Repurposing Hub-halicin-that is structurally divergent from conventional antibiotics and displays bactericidal activity against a wide phylogenetic spectrum of pathogens including Mycobacterium tuberculosis and carbapenem-resistant Enterobacteriaceae. Halicin also effectively treated Clostridioides difficile and pan-resistant Acinetobacter baumannii infections in murine models. Additionally, from a discrete set of 23 empirically tested predictions from >107 million molecules curated from the ZINC15 database, our model identified eight antibacterial compounds that are structurally distant from known antibiotics. This work highlights the utility of deep learning approaches to expand our antibiotic arsenal through the discovery of structurally distinct antibacterial molecules.

Keywords

BiologyDrug discoveryComputational biologyAntibioticsBioinformaticsMicrobiology

MeSH Terms

Acinetobacter baumanniiAnimalsAnti-Bacterial AgentsCheminformaticsClostridioides difficileDatabasesChemicalDrug DiscoveryMachine LearningMiceMiceInbred BALB CMiceInbred C57BLMycobacterium tuberculosisSmall Molecule LibrariesThiadiazoles

Affiliated Institutions

Related Publications

Publication Info

Year
2020
Type
article
Volume
180
Issue
4
Pages
688-702.e13
Citations
1900
Access
Closed

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

1900
OpenAlex
22
Influential
1625
CrossRef

Cite This

Jonathan Stokes, Kevin Yang, Kyle Swanson et al. (2020). A Deep Learning Approach to Antibiotic Discovery. Cell , 180 (4) , 688-702.e13. https://doi.org/10.1016/j.cell.2020.01.021

Identifiers

DOI
10.1016/j.cell.2020.01.021
PMID
32084340
PMCID
PMC8349178

Data Quality

Data completeness: 90%