Classification assessment methods

Alaa Tharwat

doi:10.1016/j.aci.2018.08.003

Abstract

Classification techniques have been applied to many applications in various fields of sciences. There are several ways of evaluating classification algorithms. The analysis of such metrics and its significance must be interpreted correctly for evaluating different learning algorithms. Most of these measures are scalar metrics and some of them are graphical methods. This paper introduces a detailed overview of the classification assessment measures with the aim of providing the basics of these measures and to show how it works to serve as a comprehensive source for researchers who are interested in this field. This overview starts by highlighting the definition of the confusion matrix in binary and multi-class classification problems. Many classification measures are also explained in details, and the influence of balanced and imbalanced data on each metric is presented. An illustrative example is introduced to show (1) how to calculate these measures in binary and multi-class classification problems, and (2) the robustness of some measures against balanced and imbalanced data. Moreover, some graphical measures such as Receiver operating characteristics (ROC), Precision-Recall, and Detection error trade-off (DET) curves are presented with details. Additionally, in a step-by-step approach, different numerical examples are demonstrated to explain the preprocessing steps of plotting ROC, PR, and DET curves.

Keywords

Computer sciencePreprocessorConfusion matrixMetric (unit)Binary classificationData miningRobustness (evolution)Binary numberMachine learningField (mathematics)Class (philosophy)Artificial intelligenceAlgorithmSupport vector machineMathematics

Affiliated Institutions

Related Publications

The advantages of the Matthews correlation coefficient (MCC) over F1 score and accuracy in binary classification evaluation

Davide Chicco , Giuseppe Jurman

Abstract Background To evaluate binary classifications and their confusion matrices, scientific researchers can employ several statistical rates, accordingly to the goal of the ...

2020 BMC Genomics 5067 citations

A systematic study of the class imbalance problem in convolutional neural networks

Mateusz Buda , Atsuto Maki , Maciej A. Mazurowski

In this study, we systematically investigate the impact of class imbalance on classification performance of convolutional neural networks (CNNs) and compare frequently used meth...

2018 Neural Networks 2639 citations

The Precision-Recall Plot Is More Informative than the ROC Plot When Evaluating Binary Classifiers on Imbalanced Datasets

Takaya Saito , Marc Rehmsmeier

Binary classifiers are routinely evaluated with performance measures such as sensitivity and specificity, and performance is frequently illustrated with Receiver Operating Chara...

2015 PLoS ONE 3921 citations

Learning When Training Data are Costly: The Effect of Class Distribution on Tree Induction

Gary M. Weiss , Foster Provost

For large, real-world inductive learning problems, the number of training examples often must be limited due to the costs associated with procuring, preparing, and storing the t...

2003 Journal of Artificial Intelligence Re... 918 citations

Generalized Bradley-Terry Models and Multi-Class Probability Estimates

Tzu-Kuo Huang , Ruby C. Weng , Chih‐Jen Lin

The Bradley-Terry model for obtaining individual skill from paired comparisons has been popular in many areas. In machine learning, this model is related to multi-class probabil...

2006 Journal of Machine Learning Research 151 citations

Publication Info

Year: 2018
Type: article
Volume: 17
Issue: 1
Pages: 168-192
Citations: 2157
Access: Closed

External Links

Download PDF (Free) View on DOI.org Semantic Scholar

Social Impact

Altmetric

Classification assessment methods

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

2157

OpenAlex

107

Influential

1482

CrossRef

Cite This

APA Style

                            
                                    Alaa Tharwat
                                
                            (2018). 
                            Classification assessment methods. 
                            Applied Computing and Informatics
                            , 17
                            (1)
                            , 168-192.
                            https://doi.org/10.1016/j.aci.2018.08.003

Identifiers

DOI: 10.1016/j.aci.2018.08.003

Data Quality

Data completeness: 81%