SEL-BALD: Deep Bayesian Active Learning with Selective Labels

Ruijiang Gao; Mingzhang Yin; Maytal Saar-Tsechansky

SEL-BALD: Deep Bayesian Active Learning with Selective Labels

Ruijiang Gao, Mingzhang Yin, Maytal Saar-Tsechansky

Published: 25 Sept 2024, Last Modified: 14 Jan 2025NeurIPS 2024 posterEveryoneRevisionsBibTeXCC BY 4.0

Keywords: Bayesian Active Learning with Disagreement; Selective Labels;

TL;DR: We propose novel methods for active learning under the selective labels problem.

Abstract: Machine learning systems are widely used in many high-stakes contexts in which experimental designs for assigning treatments are infeasible. When evaluating decisions is costly, such as investigating fraud cases, or evaluating biopsy decisions, a sample-efficient strategy is needed. However, while existing active learning methods assume humans will always label the instances selected by the machine learning model, in many critical applications, humans may decline to label instances selected by the machine learning model due to reasons such as regulation constraint, domain knowledge, or algorithmic aversion, thus not sample efficient. In this paper, we study the Active Learning with Instance Rejection (ALIR) problem, which considers the human discretion behavior for high-stakes decision making problems. We propose new active learning algorithms under deep bayesian active learning for selective labeling (SEL-BALD) to address the ALIR problem. Our algorithms consider how to acquire information for both the machine learning model and the human discretion model. We conduct experiments on both synthetic and real-world datasets to demonstrate the effectiveness of our proposed algorithms.

Primary Area: Active learning

Submission Number: 12433

Loading