Stopping Rules in Principal Components Analysis: A Comparison of Heuristical and Statistical Approaches

1993 Ecology 2,238 citations

Abstract

Approaches to determining the number of components to interpret from principal components analysis were compared. Heuristic procedures included: retaining components with eigenvalues (λ) > 1 (i.e., Kaiser—Guttman criterion); components with bootstrapped λ > 1 (bootstrapped Kaiser—Guttman); the scree plot; the broken—stick model; and components with λ totalling to a fixed amount of the total variance. Statistical approaches included: Bartlett's test of sphericity; Bartlett's test of homogeneity of the correlation matrix, Lawley's test of the second λ bootstrapped confidence limits on successive λ (i.e., significant differences between λ); and bootstrapped confidence limits on eigenvector coefficients (i.e., coefficients that differ significantly from zero). All methods were compared using simulated data matrices of uniform correlation structure, patterned matrices of varying correlation structure and data sets of lake morphometry, water chemistry, and benthic invertebrate abundance. The most consistent results were obtained from the broken—stick model and a combined measure using bootstrapped λ and associated eigenvector coefficients. The traditional and bootstrapped Kaiser—Guttman approaches over—estimated the number of nontrivial dimensions as did the fixed—amount—of—variance model. The scree plot consistently estimated one dimension more than the number of simulated dimensions. Barlett's test of sphericity showed inconsistent results. Both Bartlett's test of homogeneity of the correlation matrix and Lawley's test are limited to testing for only one and two dimensions, respectively.

Keywords

MathematicsStatisticsPrincipal component analysisSphericityGuttman scaleHomogeneity (statistics)Eigenvalues and eigenvectorsStatistical hypothesis testingApplied mathematicsGeometry

Related Publications

Publication Info

Year
1993
Type
article
Volume
74
Issue
8
Pages
2204-2214
Citations
2238
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

2238
OpenAlex

Cite This

Donald A. Jackson (1993). Stopping Rules in Principal Components Analysis: A Comparison of Heuristical and Statistical Approaches. Ecology , 74 (8) , 2204-2214. https://doi.org/10.2307/1939574

Identifiers

DOI
10.2307/1939574