Classifier Technology and the Illusion of Progress

Abstract

A great many tools have been developed for supervised classification, ranging from early methods such as linear discriminant analysis through to modern developments such as neural networks and support vector machines. A large number of comparative studies have been conducted in attempts to establish the relative superiority of these methods. This paper argues that these comparisons often fail to take into account important aspects of real problems, so that the apparent superiority of more sophisticated methods may be something of an illusion. In particular, simple methods typically yield performance almost as good as more sophisticated methods, to the extent that the difference in performance may be swamped by other sources of uncertainty that generally are not considered in the classical supervised classification paradigm.

Keywords

IllusionLinear discriminant analysisComputer scienceMachine learningArtificial intelligenceClassifier (UML)Support vector machineArtificial neural networkRangingSimple (philosophy)Pattern recognition (psychology)Cognitive psychologyEpistemologyPsychology

Related Publications

Semi-Supervised Classification of Network Data Using Very Few Labels

Frank Lin , William W. Cohen

The goal of semi-supervised learning (SSL) methods is to reduce the amount of labeled training data required by learning from both labeled and unlabeled instances. Macskassy and...

2010 93 citations

Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy

Hanchuan Peng , Fuhui Long , Chen Ding

Feature selection is an important problem for pattern classification systems. We study how to select good features according to the maximal statistical dependency criterion base...

2005 IEEE Transactions on Pattern Analysis... 10050 citations

Advances in kernel methods: support vector learning

Bernhard Schölkopf , Christopher J. C. Burges , Alexander J. Smola

Introduction to support vector learning roadmap. Part 1 Theory: three remarks on the support vector method of function estimation, Vladimir Vapnik generalization performance of ...

1999 International Conference on Neural In... 5814 citations

The random subspace method for constructing decision forests

Tin Kam Ho

Much of previous attention on decision trees focuses on the splitting criteria and optimization of tree sizes. The dilemma between overfitting and achieving maximum accuracy is ...

1998 IEEE Transactions on Pattern Analysis... 6677 citations

Binarized Support Vector Machines

Emilio Carrizosa , Belén Martín-Barragán , Dolores Romero Morales

The widely used support vector machine (SVM) method has shown to yield very good results in supervised classification problems. Other methods such as classification trees have b...

2009 INFORMS journal on computing 36 citations

Publication Info

Year: 2006
Type: article
Volume: 21
Issue: 1
Citations: 675
Access: Closed

External Links

View on DOI.org

Social Impact

Altmetric

Classifier Technology and the Illusion of Progress

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

675

OpenAlex

Cite This

APA Style

                            
                                    David J. Hand
                                
                            (2006). 
                            Classifier Technology and the Illusion of Progress. 
                            Statistical Science
                            , 21
                            (1)
                            .
                            https://doi.org/10.1214/088342306000000060

Identifiers

DOI: 10.1214/088342306000000060