The Effect of Class Imbalance on Precision-Recall Curves

In this note, I study how the precision of a binary classifier depends on the ratio of positive to negative cases in the test set, as well as the classifier's true and false-positive rates. This relationship allows prediction of how the precision-recall curve will change with , which seems not...

Full description

Saved in:
Bibliographic Details
Published inNeural computation Vol. 33; no. 4; pp. 853 - 857
Main Author Williams, Christopher K. I.
Format Journal Article
LanguageEnglish
Published One Rogers Street, Cambridge, MA 02142-1209, USA MIT Press 01.04.2021
MIT Press Journals, The
Online AccessGet full text

Cover

Loading…
More Information
Summary:In this note, I study how the precision of a binary classifier depends on the ratio of positive to negative cases in the test set, as well as the classifier's true and false-positive rates. This relationship allows prediction of how the precision-recall curve will change with , which seems not to be well known. It also allows prediction of how and the precision gain and recall gain measures of Flach and Kull (2015) vary with .
Bibliography:April, 2021
ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:0899-7667
1530-888X
DOI:10.1162/neco_a_01362