XGBoost | RDL Research Database

Abstract

Tree boosting is a highly effective and widely used machine learning method.\nIn this paper, we describe a scalable end-to-end tree boosting system called\nXGBoost, which is used widely by data scientists to achieve state-of-the-art\nresults on many machine learning challenges. We propose a novel sparsity-aware\nalgorithm for sparse data and weighted quantile sketch for approximate tree\nlearning. More importantly, we provide insights on cache access patterns, data\ncompression and sharding to build a scalable tree boosting system. By combining\nthese insights, XGBoost scales beyond billions of examples using far fewer\nresources than existing systems.\n

Affiliated Institutions

University of Washington US

Related Publications

A Communication-Efficient Parallel Algorithm for Decision Tree

Qi Meng , Guolin Ke , Taifeng Wang +4 more

Decision tree (and its extensions such as Gradient Boosting Decision Trees and Random Forest) is a widely used machine learning algorithm, due to its practical effectiveness and...

2016 arXiv (Cornell University) 69 citations

Publication Info

Year: 2016
Type: article
Pages: 785-794
Citations: 41264
Access: Closed

External Links

View on DOI.org

Social Impact

Altmetric

XGBoost

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

41264

OpenAlex

Cite This

APA Style

                            
                                    Tianqi Chen, 
                                
                                    Carlos Guestrin
                                
                            (2016). 
                            XGBoost. 
                            
                            , 785-794.
                            https://doi.org/10.1145/2939672.2939785

Identifiers

DOI: 10.1145/2939672.2939785