Performance Measures for Binary Classification

Berrar, Daniel (2019). Performance Measures for Binary Classification. In: Ranganathan, Shoba; Gribskov, Michael; Nakai, Kenta; Schönbach, Christian and Cannataro, Mario eds. Encyclopedia of Bioinformatics and Computational Biology. Reference Module in Life Sciences, 1. Elsevier, pp. 546–560.



This article is an introduction to some of the most commonly used performance measures for the evaluation of binary classifiers. These measures are categorized into three broad families: measures based on a single classification threshold, measures based on a probabilistic interpretation of error, and ranking measures. Graphical methods, such as ROC curves, precision-recall curves, TPR-FPR plots, gain charts, and lift charts, are also discussed. Using a simple example, we illustrate how to calculate the various performance measures and show how they are related. The article also explains how to assess the statistical significance of an obtained performance value, how to calculate approximate and exact parametric confidence intervals, and how to derive percentile bootstrap confidence intervals for a performance measure.

Viewing alternatives


Public Attention

Altmetrics from Altmetric

Number of Citations

Citations from Dimensions
No digital document available to download for this item

Item Actions