Margin-Based Active Learning of Classifiers

Published: 31 Mar 2024, Last Modified: 01 Oct 2024OpenReview Archive Direct UploadEveryoneCC0 1.0
Abstract: We study active learning of multiclass classifiers, focusing on the realizable transductive setting. The input is a finite subset of some metric space, and the concept to be learned is a partition of into classes. The goal is to learn by querying the labels of as few elements of as possible. This is a useful subroutine in pool-based active learning, and is motivated by applications where labels are expensive to obtain. Our main result is that, in very different settings, there exist interesting notions of margin that yield efficient active learning algorithms. First, we consider the case , assuming that each class has an unknown" personalized" margin separating it from the rest. Second, we consider the case where is a finite metric space, and the classes are convex with margin according to the geodesic distances in the thresholded connectivity graph. In both cases, we give algorithms that learn exactly, in polynomial time, using label queries, where hides a near-optimal dependence on the dimension of the metric spaces. Our results actually hold for or can be adapted to more general settings, such as pseudometric and semimetric spaces.
Loading