Is Active Learning Green? An Empirical Study

Shirin Salehi, Anke Schmeink

Published: 01 Jan 2023, Last Modified: 06 Nov 2025IEEE Big Data 2023EveryoneRevisionsBibTeXCC BY-SA 4.0

Abstract: Active learning (AL) is a machine learning (ML) approach that entails carefully choosing the most informative samples for annotation during training, aiming to minimize annotation costs. AL has recently emerged as a promising approach in the context of green ML, as an energy-efficient learning method on top of being data-efficient. Nevertheless, given the significant cost of AL, it might lead one to question the effectiveness of this approach in reducing computational costs and promoting green ML. In this paper, we conduct a comparative analysis of both fundamental and advanced active learning methods against a random baseline selection, aiming to demonstrate the efficacy of active learning to reduce the cost of training. This study demonstrates that, with careful tuning of hyperparameters like query size and pool size, AL is able to reduce runtime while maintaining competitive accuracy for classification tasks.

External IDs:dblp:conf/bigdataconf/SalehiS23