Abstract

The effects of sample size on feature selection and error estimation for several types of classifiers are discussed. The focus is on the two-class problem. Classifier design in the context of small design sample size is explored. The estimation of error rates under small test sample size is given. Sample size effects in feature selection are discussed. Recommendations for the choice of learning and test sample sizes are given. In addition to surveying prior work in this area, an emphasis is placed on giving practical advice to designers and users of statistical pattern recognition systems.< <ETX xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">&gt;</ETX>

Keywords

Sample size determinationComputer scienceClassifier (UML)Feature selectionArtificial intelligenceSample (material)Pattern recognition (psychology)Machine learningFeature (linguistics)Statistical hypothesis testingStatisticsData miningMathematics

Affiliated Institutions

Related Publications

Good features to track

No feature-based vision system can work unless good features can be identified and tracked from frame to frame. Although tracking itself is by and large a solved problem, select...

1994 6842 citations

Publication Info

Year
1991
Type
article
Volume
13
Issue
3
Pages
252-264
Citations
1364
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

1364
OpenAlex

Cite This

Šarūnas Raudys, Anil K. Jain (1991). Small sample size effects in statistical pattern recognition: recommendations for practitioners. IEEE Transactions on Pattern Analysis and Machine Intelligence , 13 (3) , 252-264. https://doi.org/10.1109/34.75512

Identifiers

DOI
10.1109/34.75512