Abstract
In many multivariate situations we are presented with more variables than we would like, and the question arises whether they are all necessary and if not which can be discarded. In this paper we consider two such situations. (a)Regression analysis. The problem here is whether any variables can be discarded as adding little or nothing to the accuracy with which the regression equation correlates with the dependent variable. (b)Interdependence analysis. The problem is whether a constellation in p dimonsions collapses, exactly or approximately, into fewer dimensions, and if so whether any of the original variables can be discarded. We may define the best solution to (a) using any given number of variables as the one that maximizes the multiple correlation between the selected variables and the dependent variable, and similarly for (b) as the one that maximizes the smallest multiple correlation with any of the rejected variables. In practice it is usual to accept an approximate solution to (a) based on 'step-wise' multiple regression: we know of no standard program for (b). We have developed cut-off rules that enable us to find the best solution to both problems by partial enumeration. The paper discusses the details of this approach, and computational experience.
Keywords
Related Publications
Multiple Regression in Behavioral Research: Explanation and Prediction
Part I: Foundations of Multiple Regression Analysis. Overview. Simple Linear Regression and Correlation. Regression Diagnostics. Computers and Computer Programs. Elements of Mul...
Why Stepdown Procedures in Variable Selection
Recent reviews have dealt with the subject of which variables to select and which to discard in multiple regression problems. Lindley (1968) emphasized that the method to be emp...
Applied Regression Analysis and Other Multivariable Methods
1. CONCEPTS AND EXAMPLES OF RESEARCH. Concepts. Examples. Concluding Remarks. References. 2. CLASSIFICATION OF VARIABLES AND THE CHOICE OF ANALYSIS. Classification of Variables....
Applied Multiple Regression/Correlation Analysis for the Behavioral Sciences
Contents: Preface. Introduction. Bivariate Correlation and Regression. Multiple Regression/Correlation With Two or More Independent Variables. Data Visualization, Exploration, a...
Detecting Multicollinearity in Regression Analysis
Multicollinearity occurs when the multiple linear regression analysis includes several variables that are significantly correlated not only with the dependent variable but also ...
Publication Info
- Year
- 1967
- Type
- article
- Volume
- 54
- Issue
- 3-4
- Pages
- 357-366
- Citations
- 242
- Access
- Closed
External Links
Social Impact
Social media, news, blog, policy document mentions
Citation Metrics
Cite This
Identifiers
- DOI
- 10.1093/biomet/54.3-4.357