Abstract

We present a new descriptor named signature based on extended valence sequence. The signature of an atom is a canonical representation of the atom's environment up to a predefined height h. The signature of a molecule is a vector of occurrence numbers of atomic signatures. Two QSAR and QSPR models based on signature are compared with models obtained using popular molecular 2D descriptors taken from a commercially available software (Molconn-Z). One set contains the inhibition concentration at 50% for 121 HIV-1 protease inhibitors, while the second set contains 12865 octanol/water partitioning coefficients (Log P). For both data sets, the models created by signature performed comparable to those from the commercially available descriptors in both correlating the data and in predicting test set values not used in the parametrization. While probing signature's QSAR and QSPR performances, we demonstrates that for any given molecule of diameter D, there is a molecular signature of height h </= D+1, from which any 2D descriptor can be computed. As a consequence of this finding any QSAR or QSPR involving 2D descriptors can be replaced with a relationship involving occurrence number of atomic signatures.

Keywords

Quantitative structure–activity relationshipMolecular descriptorSignature (topology)ChemistryTest setPattern recognition (psychology)Computational chemistryMathematicsBiological systemArtificial intelligenceComputer scienceStereochemistry

Affiliated Institutions

Related Publications

Publication Info

Year
2003
Type
article
Volume
43
Issue
3
Pages
707-720
Citations
234
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

234
OpenAlex

Cite This

Jean‐Loup Faulon, Donald P. Visco, Ramdas S. Pophale (2003). The Signature Molecular Descriptor. 1. Using Extended Valence Sequences in QSAR and QSPR Studies. Journal of Chemical Information and Computer Sciences , 43 (3) , 707-720. https://doi.org/10.1021/ci020345w

Identifiers

DOI
10.1021/ci020345w