Mixture Proportion Estimation and Weakly-supervised Kernel Test for Conditional Independence

Published: 03 Feb 2026, Last Modified: 03 Feb 2026AISTATS 2026 PosterEveryoneRevisionsBibTeXCC BY 4.0
TL;DR: This paper presents novel identifiability results for mixture proportion estimation with conditional independence assumption and kernel tests to validate the assumption.
Abstract: Mixture proportion estimation (MPE) aims to estimate class priors from unlabeled data. This task is a critical component in weakly supervised learning such as PU learning, learning with label noise, and domain adaptation. Existing MPE methods rely on the _irreducibility_ assumption or its variant for identifiability. In this paper, we propose novel assumptions based on conditional independence (CI) given the class label, which ensure identifiability even when irreducibility does not hold. We develop method of moments estimators under these assumptions and analyze their asymptotic properties. Furthermore, we present weakly-supervised kernel tests to validate the CI assumptions, which are of independent interest in applications such as causal discovery and fairness evaluation. Empirically, we demonstrate the improved performance of our estimators compared with existing methods and that our tests successfully control both type I and type II errors.
Submission Number: 313
Loading