Learning from Label Proportions by Learning with Label Noise

Jianxin Zhang; Yutong Wang; Clayton Scott

Learning from Label Proportions by Learning with Label Noise

Jianxin Zhang, Yutong Wang, Clayton Scott

Published: 31 Oct 2022, Last Modified: 06 Apr 2025NeurIPS 2022 AcceptReaders: Everyone

Keywords: Machine Learning, Semi-supervised Learning, Learning Theory, Learning from Label Proportions, Learning from Label Noise

TL;DR: A theoretically grounded approach to solve the problem of learning from label proportions, achieving the state of the art performance.

Abstract: Learning from label proportions (LLP) is a weakly supervised classification problem where data points are grouped into bags, and the label proportions within each bag are observed instead of the instance-level labels. The task is to learn a classifier to predict the labels of future individual instances. Prior work on LLP for multi-class data has yet to develop a theoretically grounded algorithm. In this work, we propose an approach to LLP based on a reduction to learning with label noise, using the forward correction (FC) loss of \textcite{Patrini2017MakingDN}. We establish an excess risk bound and generalization error analysis for our approach, while also extending the theory of the FC loss which may be of independent interest. Our approach demonstrates improved empirical performance in deep learning scenarios across multiple datasets and architectures, compared to the leading methods.

Supplementary Material: zip

Community Implementations: [![CatalyzeX](/images/catalyzex_icon.svg) 4 code implementations](https://www.catalyzex.com/paper/learning-from-label-proportions-by-learning/code)

15 Replies

Loading