Exploring Minimally Sufficient Representation in Active Learning through Label-Irrelevant Patch Augmentation

Published: 20 Nov 2023, Last Modified: 05 Dec 2023CPAL 2024 (Proceedings Track) OralEveryoneRevisionsBibTeX
Keywords: Active Learning, Data Augmentation, Minimally Sufficient Representation
Abstract: Deep learning models, which require abundant labeled data for training, are expensive and time-consuming to implement, particularly in medical imaging. Active learning (AL) aims to maximize model performance with few labeled samples by gradually expanding and labeling a new training set. In this work, we intend to learn a "good" feature representation that is both sufficient and minimal, facilitating effective AL for medical image classification. This work proposes an efficient AL framework based on off-the-shelf self-supervised learning models, complemented by a label-irrelevant patch augmentation scheme. This scheme is designed to reduce redundancy in the learned features and mitigate overfitting in the progress of AL. Our framework offers efficiency to AL in terms of parameters, samples, and computational costs. The benefits of this approach are extensively validated across various medical image classification tasks employing different AL strategies.~\footnote{Source Codes: \url{https://github.com/chrisyxue/DA4AL}}.
Track Confirmation: Yes, I am submitting to the proceeding track.
Submission Number: 8