Abstract
Abstract We construct a prediction rule on the basis of some data, and then wish to estimate the error rate of this rule in classifying future observations. Cross-validation provides a nearly unbiased estimate, using only the original data. Cross-validation turns out to be related closely to the bootstrap estimate of the error rate. This article has two purposes: to understand better the theoretical basis of the prediction problem, and to investigate some related estimators, which seem to offer considerably improved estimation in small samples.
Keywords
Affiliated Institutions
Related Publications
Improvements on Cross-Validation: The 632+ Bootstrap Method
Abstract A training set of data has been used to construct a rule for predicting future responses. What is the error rate of this rule? This is an important question both for co...
Selection bias in gene extraction on the basis of microarray gene-expression data
In the context of cancer diagnosis and treatment, we consider the problem of constructing an accurate prediction rule on the basis of a relatively small number of tumor tissue s...
Summarizing the predictive power of a generalized linear model
This paper studies summary measures of the predictive power of a generalized linear model, paying special attention to a generalization of the multiple correlation coefficient f...
Empirical Functionals and Efficient Smoothing Parameter Selection
SUMMARY A striking feature of curve estimation is that the smoothing parameter ĥ 0, which minimizes the squared error of a kernel or smoothing spline estimator, is very difficul...
How Biased is the Apparent Error Rate of a Prediction Rule?
Abstract A regression model is fitted to an observed set of data. How accurate is the model for predicting future observations? The apparent error rate tends to underestimate th...
Publication Info
- Year
- 1983
- Type
- article
- Volume
- 78
- Issue
- 382
- Pages
- 316-331
- Citations
- 2126
- Access
- Closed
External Links
Social Impact
Social media, news, blog, policy document mentions
Citation Metrics
Cite This
Identifiers
- DOI
- 10.1080/01621459.1983.10477973