Abstract

Abstract A training set of data has been used to construct a rule for predicting future responses. What is the error rate of this rule? This is an important question both for comparing models and for assessing a final selected model. The traditional answer to this question is given by cross-validation. The cross-validation estimate of prediction error is nearly unbiased but can be highly variable. Here we discuss bootstrap estimates of prediction error, which can be thought of as smoothed versions of cross-validation. We show that a particular bootstrap method, the .632+ rule, substantially outperforms cross-validation in a catalog of 24 simulation experiments. Besides providing point estimates, we also consider estimating the variability of an error rate estimate. All of the results here are nonparametric and apply to any possible prediction rule; however, we study only classification problems with 0–1 loss in detail. Our simulations include "smooth" prediction rules like Fisher's linear discriminant function and unsmooth ones like nearest neighbors.

Keywords

Cross-validationComputer scienceNonparametric statisticsSet (abstract data type)StatisticsData setData miningMathematicsArtificial intelligence

Affiliated Institutions

Related Publications

A Direct Approach to False Discovery Rates

Summary Multiple-hypothesis testing involves guarding against much more complicated errors than single-hypothesis testing. Whereas we typically control the type I error rate for...

2002 Journal of the Royal Statistical Soci... 5607 citations

Publication Info

Year
1997
Type
article
Volume
92
Issue
438
Pages
548-560
Citations
1363
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

1363
OpenAlex

Cite This

Bradley Efron, Robert Tibshirani (1997). Improvements on Cross-Validation: The 632+ Bootstrap Method. Journal of the American Statistical Association , 92 (438) , 548-560. https://doi.org/10.1080/01621459.1997.10474007

Identifiers

DOI
10.1080/01621459.1997.10474007