Mutual Information Estimation via $f$-Divergence and Data Derangements

Nunzio Alexandro Letizia; Nicola Novello; Andrea M Tonello

Mutual Information Estimation via $f$-Divergence and Data Derangements

Nunzio Alexandro Letizia, Nicola Novello, Andrea M Tonello

Published: 25 Sept 2024, Last Modified: 06 Nov 2024NeurIPS 2024 posterEveryoneRevisionsBibTeXCC BY 4.0

Keywords: mutual information, variational divergence, f-divergence, neural estimators, permutation, derangement

TL;DR: A new method for estimating mutual information exploiting the variational representation of the $f$-divergence and a derangement training strategy

Abstract: Estimating mutual information accurately is pivotal across diverse applications, from machine learning to communications and biology, enabling us to gain insights into the inner mechanisms of complex systems. Yet, dealing with high-dimensional data presents a formidable challenge, due to its size and the presence of intricate relationships. Recently proposed neural methods employing variational lower bounds on the mutual information have gained prominence. However, these approaches suffer from either high bias or high variance, as the sample size and the structure of the loss function directly influence the training process. In this paper, we propose a novel class of discriminative mutual information estimators based on the variational representation of the $f$-divergence. We investigate the impact of the permutation function used to obtain the marginal training samples and present a novel architectural solution based on derangements. The proposed estimator is flexible since it exhibits an excellent bias/variance trade-off. The comparison with state-of-the-art neural estimators, through extensive experimentation within established reference scenarios, shows that our approach offers higher accuracy and lower complexity.

Supplementary Material: zip

Primary Area: Probabilistic methods (for example: variational inference, Gaussian processes)

Submission Number: 12186

Loading