A bumpy journey: exploring deep Gaussian mixture models

Margot Selosse; Claire Gormley; Julien Jacques; Christophe BIERNACKI

A bumpy journey: exploring deep Gaussian mixture models

Margot Selosse, Claire Gormley, Julien Jacques, Christophe BIERNACKI

Published: 09 Dec 2020, Last Modified: 05 May 2023ICBINB 2020 SpotlightReaders: Everyone

Keywords: factor analysis, neural networks, deep gaussian mixture models, model-based clustering

Abstract: The deep Gaussian mixture model (DGMM) is a framework directly inspired by the finite mixture of factor analysers model (MFA) and the deep learning architecture composed of multiple layers. The MFA is a generative model that considers a data point as arising from a latent variable (termed the score) which is sampled from a standard multivariate Gaussian distribution and then transformed linearly. The linear transformation matrix (termed the loading matrix) is specific to a component in the finite mixture. The DGMM consists of stacking MFA layers, in the sense that the latent scores are no longer assumed to be drawn from a standard Gaussian, but rather are drawn from a mixture of factor analysers model. Thus the latent scores are at one point considered to be the input of an MFA and also to have latent scores themselves. The latent scores of the DGMM's last layer only are considered to be drawn from a standard multivariate Gaussian distribution. In recent years, the DGMM gained prominence in the literature: intuitively, this model should be able to capture distributions more precisely than a simple Gaussian mixture model. We show in this work that while the DGMM is an original and novel idea, in certain cases it is challenging to infer its parameters. In addition, we give some insights to the probable reasons of this difficulty. Experimental results are provided on github: https://github.com/ansubmissions/ICBINB, alongside an R package that implements the algorithm and a number of ready-to-run R scripts.

1 Reply

Loading