Experiments with a new boosting algorithm

Abstract

In an earlier paper, we introduced a new &quot;boosting&quot; algorithm called AdaBoost which, theoretically, can be used to significantly reduce the error of any learning algorithm that consistently generates classifiers whose performance is a little better than random guessing. We also introduced the related notion of a &quot;pseudo-loss&quot; which is a method for forcing a learning algorithm of multi-label conceptsto concentrate on the labels that are hardest to discriminate. In this paper, we describe experiments we carried out to assess how well AdaBoost with and without pseudo-loss, performs on real learning problems. We performed two sets of experiments. The first set compared boosting to Breiman&apos;s &quot;bagging&quot; method when used to aggregate various classifiers (including decision trees and single attributevalue tests). We compared the performance of the two methods on a collection of machine-learning benchmarks. In the second set of experiments, we studied in more detail the performance of boosting...

Keywords

Boosting (machine learning)AdaBoostComputer scienceArtificial intelligenceMachine learningEnsemble learningGradient boostingAlgorithmClassifier (UML)Decision treeRandom forestk-nearest neighbors algorithmTraining setPattern recognition (psychology)

Affiliated Institutions

AT&T (United States) US

Related Publications

Using output codes to boost multiclass learning problems

Robert E. Schapire

This paper describes a new technique for solving multiclass learning problems by combining Freund and Schapire&apos;s boosting algorithm with the main ideas of Dietterich an...

1997 263 citations

Machine Learning Bias, Statistical Bias, and Statistical Variance of Decision Tree Algorithms

Thomas G. Dietterich , Eun Hui Bae

The term &quot;bias&quot; is widely used---and with different meanings---in the fields of machine learning and statistics. This paper clarifies the uses of this term and...

2008 200 citations

Generalized Boosted Models: A guide to the gbm package

Greg Ridgeway

Boosting takes on various forms with different programs using different loss functions, different base models, and different optimization schemes. The gbm package takes the appr...

2006 769 citations

Bagging, boosting, and C4.S

J. R. Quinlan

Breiman's bagging and Freund and Schapire's boosting are recent methods for improving the predictive power of classifier learning systems. Both form a set of classifiers that ar...

1996 National Conference on Artificial Int... 1262 citations

A Communication-Efficient Parallel Algorithm for Decision Tree

Qi Meng , Guolin Ke , Taifeng Wang +4 more

Decision tree (and its extensions such as Gradient Boosting Decision Trees and Random Forest) is a widely used machine learning algorithm, due to its practical effectiveness and...

2016 arXiv (Cornell University) 69 citations

Publication Info

Year: 1996
Type: article
Pages: 148-156
Citations: 7561
Access: Closed

External Links

Citation Metrics

7561

OpenAlex

Cite This

APA Style

                            
                                    Yoav Freund, 
                                
                                    Robert E. Schapire
                                
                            (1996). 
                            Experiments with a new boosting algorithm. 
                            
                            , 148-156.