Abstract

Classification techniques have been applied to many applications in various fields of sciences. There are several ways of evaluating classification algorithms. The analysis of such metrics and its significance must be interpreted correctly for evaluating different learning algorithms. Most of these measures are scalar metrics and some of them are graphical methods. This paper introduces a detailed overview of the classification assessment measures with the aim of providing the basics of these measures and to show how it works to serve as a comprehensive source for researchers who are interested in this field. This overview starts by highlighting the definition of the confusion matrix in binary and multi-class classification problems. Many classification measures are also explained in details, and the influence of balanced and imbalanced data on each metric is presented. An illustrative example is introduced to show (1) how to calculate these measures in binary and multi-class classification problems, and (2) the robustness of some measures against balanced and imbalanced data. Moreover, some graphical measures such as Receiver operating characteristics (ROC), Precision-Recall, and Detection error trade-off (DET) curves are presented with details. Additionally, in a step-by-step approach, different numerical examples are demonstrated to explain the preprocessing steps of plotting ROC, PR, and DET curves.

Keywords

Computer sciencePreprocessorConfusion matrixMetric (unit)Binary classificationData miningRobustness (evolution)Binary numberMachine learningField (mathematics)Class (philosophy)Artificial intelligenceAlgorithmSupport vector machineMathematics

Affiliated Institutions

Related Publications

Publication Info

Year
2018
Type
article
Volume
17
Issue
1
Pages
168-192
Citations
2157
Access
Closed

Social Impact

Altmetric

Social media, news, blog, policy document mentions

Citation Metrics

2157
OpenAlex
107
Influential
1482
CrossRef

Cite This

Alaa Tharwat (2018). Classification assessment methods. Applied Computing and Informatics , 17 (1) , 168-192. https://doi.org/10.1016/j.aci.2018.08.003

Identifiers

DOI
10.1016/j.aci.2018.08.003

Data Quality

Data completeness: 81%