Abstract
We have used a Bayesian neural network to distinguish between drugs and nondrugs. For this purpose, the CMC acts as a surrogate for drug-like molecules while the ACD is a surrogate for nondrug-like molecules. This task is performed by using two different set of 1D and 2D parameters. The 1D parameters contain information about the entire molecule like the molecular weight and the the 2D parameters contain information about specific functional groups within the molecule. Our best results predict correctly on over 90% of the compounds in the CMC while classifying about 10% of the molecules in the ACD as drug-like. Excellent generalization ability is shown by the models in that roughly 80% of the molecules in the MDDR are classified as drug-like. We propose to use the models to design combinatorial libraries. In a computer experiment on generating a drug-like library of size 100 from a set of 10 000 molecules we obtain at least a 3 or 4 order of magnitude improvement over random methods. The neighborhoods defined by our models are not similar to the ones generated by standard Tanimoto similarity calculations. Therefore, new and different information is being generated by our models, and so it can supplement standard diversity approaches to library design.
Keywords
Affiliated Institutions
Related Publications
A Large Descriptor Set and a Probabilistic Kernel-Based Classifier Significantly Improve Druglikeness Classification
Probabilistic support vector machine (SVM) in combination with ECFP_4 (Extended Connectivity Fingerprints) were applied to establish a druglikeness filter for molecules. Here, t...
Molecular Similarity Based on DOCK-Generated Fingerprints
An alternative method for defining molecular similarity is presented. By using the docking program DOCK and a reference panel of protein binding sites, fingerprints for a set of...
Uni-Mol: A Universal 3D Molecular Representation Learning Framework
Molecular representation learning (MRL) has gained tremendous attention due to its critical role in learning from limited supervised data for applications like drug design. In m...
Bridging Chemical and Biological Space: “Target Fishing” Using 2D and 3D Molecular Descriptors
Bridging chemical and biological space is the key to drug discovery and development. Typically, cheminformatics methods operate under the assumption that similar chemicals have ...
Scaffold Hopping through Virtual Screening Using 2D and 3D Similarity Descriptors: Ranking, Voting, and Consensus Scoring
The ability to find novel bioactive scaffolds in compound similarity-based virtual screening experiments has been studied comparing Tanimoto-based, ranking-based, voting, and co...
Publication Info
- Year
- 1998
- Type
- article
- Volume
- 41
- Issue
- 18
- Pages
- 3314-3324
- Citations
- 478
- Access
- Closed
External Links
Social Impact
Social media, news, blog, policy document mentions
Citation Metrics
Cite This
Identifiers
- DOI
- 10.1021/jm970666c