Solving Multiclass Learning Problems via Error-Correcting Output Codes

Tom Dietterich; Ghulum Bakiri

doi:10.1613/jair.105

Abstract

Multiclass learning problems involve finding a definitionfor an unknown function f(x) whose range is a discrete setcontaining k > 2 values (i.e., k ``classes''). Thedefinition is acquired by studying collections of training examples ofthe form [x_i, f (x_i)]. Existing approaches tomulticlass learning problems include direct application of multiclassalgorithms such as the decision-tree algorithms C4.5 and CART,application of binary concept learning algorithms to learn individualbinary functions for each of the k classes, and application ofbinary concept learning algorithms with distributed outputrepresentations. This paper compares these three approaches to a newtechnique in which error-correcting codes are employed as adistributed output representation. We show that these outputrepresentations improve the generalization performance of both C4.5and backpropagation on a wide range of multiclass learning tasks. Wealso demonstrate that this approach is robust with respect to changesin the size of the training sample, the assignment of distributedrepresentations to particular classes, and the application ofoverfitting avoidance techniques such as decision-tree pruning.Finally, we show that---like the other methods---the error-correctingcode technique can provide reliable class probability estimates.Taken together, these results demonstrate that error-correcting outputcodes provide a general-purpose method for improving the performanceof inductive learning programs on multiclass problems.

Keywords

Multiclass classificationPruningComputer scienceGeneralizationArtificial intelligenceMachine learningRange (aeronautics)Class (philosophy)Decision treeProbably approximately correct learningRepresentation (politics)Binary numberGeneralization errorAlgorithmArtificial neural networkMathematicsArithmetic

Affiliated Institutions

Related Publications

Using output codes to boost multiclass learning problems

Robert E. Schapire

This paper describes a new technique for solving multiclass learning problems by combining Freund and Schapire&apos;s boosting algorithm with the main ideas of Dietterich an...

1997 263 citations

Generalized Bradley-Terry Models and Multi-Class Probability Estimates

Tzu-Kuo Huang , Ruby C. Weng , Chih‐Jen Lin

The Bradley-Terry model for obtaining individual skill from paired comparisons has been popular in many areas. In machine learning, this model is related to multi-class probabil...

2006 Journal of Machine Learning Research 151 citations

Improved boosting algorithms using confidence-rated predictions

Robert E. Schapire , Yoram Singer

We describe several improvements to Freund and Schapire‘s AdaBoost boosting algorithm, particularly in a setting in which hypotheses may assign confidences to each of their pred...

1998 2556 citations

Good quantum error-correcting codes exist

A.R. Calderbank , Peter W. Shor

A quantum error-correcting code is defined to be a unitary mapping (encoding)\nof k qubits (2-state quantum systems) into a subspace of the quantum state\nspace of n qubits such...

1996 Physical Review A 2414 citations

Best-first Decision Tree Learning

Haijian Shi

Decision trees are potentially powerful predictors and explicitly represent the structure of a dataset. Standard decision tree learners such as C4.5 expand nodes in depth-first ...

2007 Research Commons (University of Waikato) 229 citations

Publication Info

Year: 1995
Type: article
Volume: 2
Pages: 263-286
Citations: 2687
Access: Closed

External Links

View on DOI.org

Social Impact

Altmetric

Solving Multiclass Learning Problems via Error-Correcting Output Codes

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

2687

OpenAlex

Cite This

APA Style

                            
                                    Tom Dietterich, 
                                
                                    Ghulum Bakiri
                                
                            (1995). 
                            Solving Multiclass Learning Problems via Error-Correcting Output Codes. 
                            Journal of Artificial Intelligence Research
                            , 2
                            
                            , 263-286.
                            https://doi.org/10.1613/jair.105

Identifiers

DOI: 10.1613/jair.105