Abstract

The natural complexity of ecological communities regularly lures ecologists to collect elaborate data sets in which confounding factors are often present. Although multiple regression is commonly used in such cases to test the individual effects of many explanatory variables on a continuous response, the inherent collinearity (multicollinearity) of confounded explanatory variables encumbers analyses and threatens their statistical and inferential interpretation. Using numerical simulations, I quantified the impact of multicollinearity on ecological multiple regression and found that even low levels of collinearity bias analyses (r ≥ 0.28 or r2 ≥ 0.08), causing (1) inaccurate model parameterization, (2) decreased statistical power, and (3) exclusion of significant predictor variables during model creation. Then, using real ecological data, I demonstrated the utility of various statistical techniques for enhancing the reliability and interpretation of ecological multiple regression in the presence of multicollinearity.

Keywords

MulticollinearityCollinearityStatisticsRegression analysisVariance inflation factorEcologyConfoundingLinear regressionEconometricsRegressionRegression diagnosticMathematicsBiologyPolynomial regression

Affiliated Institutions

Related Publications

Multicollinearity

Abstract Multicollinearity refers to the linear relation among two or more variables. It is a data problem which may cause serious difficulty with the reliability of the estimat...

2010 Wiley Interdisciplinary Reviews Compu... 840 citations

Publication Info

Year
2003
Type
article
Volume
84
Issue
11
Pages
2809-2815
Citations
2447
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

2447
OpenAlex

Cite This

Michael H. Graham (2003). CONFRONTING MULTICOLLINEARITY IN ECOLOGICAL MULTIPLE REGRESSION. Ecology , 84 (11) , 2809-2815. https://doi.org/10.1890/02-3114

Identifiers

DOI
10.1890/02-3114