Abstract

Eleven popular scoring functions have been tested on 100 protein-ligand complexes to evaluate their abilities to reproduce experimentally determined structures and binding affinities. They include four scoring functions implemented in the LigFit module in Cerius2 (LigScore, PLP, PMF, and LUDI), four scoring functions implemented in the CScore module in SYBYL (F-Score, G-Score, D-Score, and ChemScore), the scoring function implemented in the AutoDock program, and two stand-alone scoring functions (DrugScore and X-Score). These scoring functions are not tested in the context of a particular docking program. Instead, conformational sampling and scoring are separated into two consecutive steps. First, an exhaustive conformational sampling is performed by using the AutoDock program to generate an ensemble of docked conformations for each ligand molecule. This conformational ensemble is required to cover the entire conformational space as much as possible rather than to focus on a few energy minima. Then, each scoring function is applied to score this conformational ensemble to see if it can identify the experimentally observed conformation from all of the other decoys. Among all of the scoring functions under test, six of them, i.e., PLP, F-Score, LigScore, DrugScore, LUDI, and X-Score, yield success rates higher than the AutoDock scoring function. The success rates of these six scoring functions range from 66% to 76% if using root-mean-square deviation < or =2.0 A as the criterion. Combining any two or three of these six scoring functions into a consensus scoring scheme further improves the success rate to nearly 80% or even higher. However, when applied to reproduce the experimentally determined binding affinities of the 100 protein-ligand complexes, only X-Score, PLP, DrugScore, and G-Score are able to give correlation coefficients over 0.50. All of the 11 scoring functions are further inspected by their abilities to construct a descriptive, funnel-shaped energy surface for protein-ligand complexation. The results indicate that X-Score and DrugScore perform better than the other ones at this aspect.

Keywords

AutoDockDocking (animal)Scoring ruleScoring systemChemistryArtificial intelligenceComputer scienceMachine learningIn silicoBiochemistryMedicine

Affiliated Institutions

Related Publications

Publication Info

Year
2003
Type
article
Volume
46
Issue
12
Pages
2287-2303
Citations
842
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

842
OpenAlex

Cite This

Renxiao Wang, Yipin Lu, Shaomeng Wang (2003). Comparative Evaluation of 11 Scoring Functions for Molecular Docking. Journal of Medicinal Chemistry , 46 (12) , 2287-2303. https://doi.org/10.1021/jm0203783

Identifiers

DOI
10.1021/jm0203783