An overtraining-resistant stochastic modeling method for pattern recognition

1996 The Annals of Statistics 135 citations

Abstract

We will introduce a generic approach for solving problems in pattern recognition based on the synthesis of accurate multiclass discriminators from large numbers of very inaccurate "weak" models through the use of discrete stochastic processes. Contrary to the standard expectation held for the many statistical and heuristic techniques normally associated with the field, a significant feature of this method of "stochastic modeling" is its resistance to so-called "overtraining." The drop in performance of any stochastic model in going from training to test data remains comparable to that of the component weak models from which it is synthesized; and since these component models are very simple, their performance drop is small, resulting in a stochastic model whose performance drop is also small despite its high level of accuracy.

Keywords

OvertrainingStochastic modellingMathematicsHeuristicComponent (thermodynamics)Machine learningArtificial intelligenceStochastic processFeature (linguistics)Mathematical optimizationAlgorithmComputer sciencePattern recognition (psychology)Statistics

Affiliated Institutions

Related Publications

An Overview of Overfitting and its Solutions

Overfitting is a fundamental issue in supervised machine learning which prevents us from perfectly generalizing the models to well fit observed data on training data, as well as...

2019 Journal of Physics Conference Series 2055 citations

Publication Info

Year
1996
Type
article
Volume
24
Issue
6
Citations
135
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

135
OpenAlex

Cite This

E. M. Kleinberg (1996). An overtraining-resistant stochastic modeling method for pattern recognition. The Annals of Statistics , 24 (6) . https://doi.org/10.1214/aos/1032181157

Identifiers

DOI
10.1214/aos/1032181157