Abstract

Boosted decision trees are one of the most popular and successful learning techniques used today. While exhibiting fast speeds at test time, relatively slow training makes them impractical for applications with real-time learning requirements. We propose a principled approach to overcome this drawback. We prove a bound on the error of a decision stump given its preliminary error on a subset of the training data; the bound may be used to prune unpromising features early on in the training process. We propose a fast training algorithm that exploits this bound, yielding speedups of an order of magnitude at no cost in the final performance of the classifier. Our method is not a new variant of Boosting; rather, it may be used in conjunction with existing Boosting algorithms and other sampling heuristics to achieve even greater speedups.

Keywords

Boosting (machine learning)Computer scienceExploitMachine learningArtificial intelligenceDecision treeGradient boostingPruningClassifier (UML)Training setRandom forest

Affiliated Institutions

Related Publications

Publication Info

Year
2013
Type
article
Pages
594-602
Citations
123
Access
Closed

External Links

Citation Metrics

123
OpenAlex

Cite This

Ron D. Appel, Thomas J. Fuchs, Piotr Dollár et al. (2013). Quickly Boosting Decision Trees - Pruning Underachieving Features Early. The Caltech Institute Archives (California Institute of Technology) , 594-602.