Abstract
In machine learning problems, differences in prior class probabilities -- or class imbalances -- have been reported to hinder the performance of some standard classifiers, such as decision trees. This paper presents a systematic study aimed at answer
Keywords
Affiliated Institutions
Related Publications
Class imbalances versus small disjuncts
It is often assumed that class imbalances are responsible for significant losses of performance in standard classifiers. The purpose of this paper is to the question whether cla...
A study of the behavior of several methods for balancing machine learning training data
There are several aspects that might influence the performance achieved by existing learning systems. It has been reported that one of these aspects is related to class imbalanc...
A systematic study of the class imbalance problem in convolutional neural networks
In this study, we systematically investigate the impact of class imbalance on classification performance of convolutional neural networks (CNNs) and compare frequently used meth...
Learning When Training Data are Costly: The Effect of Class Distribution on Tree Induction
For large, real-world inductive learning problems, the number of training examples often must be limited due to the costs associated with procuring, preparing, and storing the t...
Improved boosting algorithms using confidence-rated predictions
We describe several improvements to Freund and Schapireās AdaBoost boosting algorithm, particularly in a setting in which hypotheses may assign confidences to each of their pred...
Publication Info
- Year
- 2002
- Type
- article
- Volume
- 6
- Issue
- 5
- Pages
- 429-449
- Citations
- 3155
- Access
- Closed
External Links
Social Impact
Social media, news, blog, policy document mentions
Citation Metrics
Cite This
Identifiers
- DOI
- 10.3233/ida-2002-6504