Class-Weighted Evaluation Metrics for Imbalanced Data Classification

Akhilesh Gupta; Nesime Tatbul; Ryan Marcus; Shengtian Zhou; Insup Lee; Justin Gottschlich

Class-Weighted Evaluation Metrics for Imbalanced Data Classification

Akhilesh Gupta, Nesime Tatbul, Ryan Marcus, Shengtian Zhou, Insup Lee, Justin Gottschlich

28 Sept 2020 (modified: 05 May 2023)ICLR 2021 Conference Blind SubmissionReaders: Everyone

Keywords: Imbalanced data classification, Evaluation metrics, Log parsing, Sentiment analysis

Abstract: Class distribution skews in imbalanced datasets may lead to models with prediction bias towards majority classes, making fair assessment of classifiers a challenging task. Balanced Accuracy is a popular metric used to evaluate a classifier’s prediction performance under such scenarios. However, this metric falls short when classes vary in importance, especially when class importance is skewed differently from class cardinality distributions. In this paper, we propose a simple and general-purpose evaluation framework for imbalanced data classification that is sensitive to arbitrary skews in class cardinalities and importances. Experiments with several state-of-the-art classifiers tested on real-world datasets and benchmarks from two different domains show that our new framework is more effective than Balanced Accuracy –- not only in evaluating and ranking model predictions, but also in training the models themselves.

One-sentence Summary: We present an evaluation framework for imbalanced data classification that is sensitive to arbitrary skews in class cardinalities and importances.

Code Of Ethics: I acknowledge that I and all co-authors of this work have read and commit to adhering to the ICLR Code of Ethics

Supplementary Material: zip

Reviewed Version (pdf): https://openreview.net/references/pdf?id=3sWZpVAG8b

5 Replies

Loading