Abstract
Abstract Regression of the experimental data of one independent variable, y vs . a linear combination of functions of an independent variable of the form y = Σβ j f j (x) is considered. Inherent collinearity among the terms of such functions may prevent obtaining a model of a desired accuracy. Traditional collinearity indicators, condition number of the normal matrix, variance inflation factor, and a new indicator (truncation‐error‐to‐noise ratio) are used to investigate the effects of the range and precision of the independent‐variable data on collinearity among functions in a regression model. Statistical confidence intervals are used to demonstrate harmful effects of collinearity. The harmful effects increase by reducing the range of the independent variable data and/or its precision. Using only independent variable data, the new collinearity indicator allows the identification of the point where the number of terms in a particular regression model becomes larger than can be justified on statistical grounds. The use of the new criterion can improve experimental design in order to minimize the harmful effects of collinearity and enable a rapid screen of correlations published in the literature for identifying those that include more parameters than can be justified.
Keywords
Affiliated Institutions
Related Publications
Generalized Collinearity Diagnostics
Abstract Working in the context of the linear model y = Xβ + ε, we generalize the concept of variance inflation as a measure of collinearity to a subset of parameters in β (deno...
Test for harmful collinearity among predictor variables used in modeling global temperature
CR Climate Research Contact the journal Facebook Twitter RSS Mailing List Subscribe to our mailing list via Mailchimp HomeLatest VolumeAbout the JournalEditorsSpecials CR 24:15-...
Collinearity: a review of methods to deal with it and a simulation study evaluating their performance
Collinearity refers to the non independence of predictor variables, usually in a regression‐type analysis. It is a common feature of any descriptive ecological data set and can ...
MCMC Methods for Multi-Response Generalized Linear Mixed Models: The<b>MCMCglmm</b><i>R</i>Package
Generalized linear mixed models provide a flexible framework for modeling a range of data, although with non-Gaussian response variables the likelihood cannot be obtained in clo...
CONFRONTING MULTICOLLINEARITY IN ECOLOGICAL MULTIPLE REGRESSION
The natural complexity of ecological communities regularly lures ecologists to collect elaborate data sets in which confounding factors are often present. Although multiple regr...
Publication Info
- Year
- 1998
- Type
- article
- Volume
- 44
- Issue
- 3
- Pages
- 603-611
- Citations
- 56
- Access
- Closed
External Links
Social Impact
Social media, news, blog, policy document mentions
Citation Metrics
Cite This
Identifiers
- DOI
- 10.1002/aic.690440311