Reinforced active learning for image segmentation

Arantxa Casanova; Pedro O. Pinheiro; Negar Rostamzadeh; Christopher J. Pal

Reinforced active learning for image segmentation

Arantxa Casanova, Pedro O. Pinheiro, Negar Rostamzadeh, Christopher J. Pal

Published: 20 Dec 2019, Last Modified: 22 Jun 2025ICLR 2020 Conference Blind SubmissionReaders: Everyone

Keywords: semantic segmentation, active learning, reinforcement learning

TL;DR: Learning a labeling policy with reinforcement learning to reduce labeling effort for the task of semantic segmentation

Abstract: Learning-based approaches for semantic segmentation have two inherent challenges. First, acquiring pixel-wise labels is expensive and time-consuming. Second, realistic segmentation datasets are highly unbalanced: some categories are much more abundant than others, biasing the performance to the most represented ones. In this paper, we are interested in focusing human labelling effort on a small subset of a larger pool of data, minimizing this effort while maximizing performance of a segmentation model on a hold-out set. We present a new active learning strategy for semantic segmentation based on deep reinforcement learning (RL). An agent learns a policy to select a subset of small informative image regions -- opposed to entire images -- to be labeled, from a pool of unlabeled data. The region selection decision is made based on predictions and uncertainties of the segmentation model being trained. Our method proposes a new modification of the deep Q-network (DQN) formulation for active learning, adapting it to the large-scale nature of semantic segmentation problems. We test the proof of concept in CamVid and provide results in the large-scale dataset Cityscapes. On Cityscapes, our deep RL region-based DQN approach requires roughly 30% less additional labeled data than our most competitive baseline to reach the same performance. Moreover, we find that our method asks for more labels of under-represented categories compared to the baselines, improving their performance and helping to mitigate class imbalance.

Code: [![github](/images/github_icon.svg) ArantxaCasanova/ralis](https://github.com/ArantxaCasanova/ralis)

Community Implementations: [![CatalyzeX](/images/catalyzex_icon.svg) 1 code implementation](https://www.catalyzex.com/paper/reinforced-active-learning-for-image/code)

Original Pdf: pdf

9 Replies

Loading