Abstract
Abstract The frequency of chance correlation using partial least squares (PLS) has been measured experimentally for variously dimensioned data, comprising either completely random numbers, random numbers containing a perfect correlation within, and CoMFA field descriptors. This frequency, much lower than that for stepwise multiple regression, is maximal for datasets in which the number of descriptors equals the number of compounds, and surprisingly decreases indefinitely as the number of descriptors becomes much greater than the number of compounds. However, perfect correlations involving descriptor subsets are not detected by PLS if the number of irrelevant descriptors is excessive. In CoMFA applications, the probability of chance correlation is usually negligible. For example with 21 compounds a crossvalidated r 2 value greater than 0.25 will occur by chance in less than 5% of trials.
Keywords
Related Publications
A test of significance for partial least squares regression
Abstract Partial least squares (PLS) regression is a commonly used statistical technique for performing multivariate calibration, especially in situations where there are more v...
Classification Using Generalized Partial Least Squares
AbstractAdvances in computational biology have made simultaneous monitoring of thousands of features possible. The high throughput technologies not only bring about a much riche...
Partial least squares for discrimination
Abstract Partial least squares (PLS) was not originally designed as a tool for statistical discrimination. In spite of this, applied scientists routinely use PLS for classificat...
Prediction intervals in partial least squares
Partial least squares (PLS) regression has become a popular technique within the chemometric community, particularly for dealing with calibration problems. An important aspect o...
Partial least squares regression and projection on latent structure regression (PLS Regression)
Abstract Partial least squares (PLS) regression ( a.k.a. projection on latent structures) is a recent technique that combines features from and generalizes principal component a...
Publication Info
- Year
- 1993
- Type
- article
- Volume
- 12
- Issue
- 2
- Pages
- 137-145
- Citations
- 296
- Access
- Closed
External Links
Social Impact
Social media, news, blog, policy document mentions
Citation Metrics
Cite This
Identifiers
- DOI
- 10.1002/qsar.19930120205