Arcing classifier (with discussion and a rejoinder by the author)

Abstract

Recent work has shown that combining multiple versions of unstable\nclassifiers such as trees or neural nets results in reduced test set error. One\nof the more effective is bagging. Here, modified training sets are formed by\nresampling from the original training set, classifiers constructed using these\ntraining sets and then combined by voting. Freund and Schapire propose an\nalgorithm the basis of which is to adaptively resample and combine (hence the\nacronym “arcing”) so that the weights in the resampling are\nincreased for those cases most often misclassified and the combining is done by\nweighted voting. Arcing is more successful than bagging in test set error\nreduction. We explore two arcing algorithms, compare them to each other and to\nbagging, and try to understand how arcing works. We introduce the definitions\nof bias and variance for a classifier as components of the test set error.\nUnstable classifiers can have low bias on a large range of data sets. Their\nproblem is high variance. Combining multiple versions either through bagging or\narcing reduces variance significantly.

Keywords

ResamplingClassifier (UML)MathematicsArtificial intelligencePattern recognition (psychology)AlgorithmTest setVariance (accounting)Machine learningComputer scienceStatistics

Affiliated Institutions

University of California, Berkeley US

Related Publications

Boosting the margin: a new explanation for the effectiveness of voting methods

Peter L. Bartlett , Yoav Freund , Wee Sun Lee +1 more

One of the surprising recurring phenomena observed in experiments\nwith boosting is that the test error of the generated classifier usually does\nnot increase as its size become...

1998 The Annals of Statistics 2310 citations

Bootstrap Techniques for Error Estimation

Anil K. Jain , Richard C. Dubes , Chien-Chang Chen

The design of a pattern recognition system requires careful attention to error estimation. The error rate is the most important descriptor of a classifier's performance. The com...

1987 IEEE Transactions on Pattern Analysis... 198 citations

Boosting the margin: A new explanation for the effectiveness of voting methods

Robert E. Schapire , Yoav Freund , Peter Barlett +1 more

One of the surprising recurring phenomena observed in experiments with boosting is that the test error of the generated classifier usually does not increase as its size becomes ...

1997 QUT ePrints (Queensland University of... 578 citations

Publication Info

Year: 1998
Type: article
Volume: 26
Issue: 3
Citations: 1088
Access: Closed

External Links

View on DOI.org

Social Impact

Altmetric

Arcing classifier (with discussion and a rejoinder by the author)

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

1088

OpenAlex

Cite This

APA Style

                            
                                    Leo Breiman
                                
                            (1998). 
                            Arcing classifier (with discussion and a rejoinder by the author). 
                            The Annals of Statistics
                            , 26
                            (3)
                            .
                            https://doi.org/10.1214/aos/1024691079

Identifiers

DOI: 10.1214/aos/1024691079