Distillation of a tractable model from the VQ-VAE

Armin Hadžić; Milan Papez; Tomáš Pevný

Distillation of a tractable model from the VQ-VAE

Armin Hadžić, Milan Papez, Tomáš Pevný

Published: 17 Jun 2025, Last Modified: 14 Jul 2025TPM 2025EveryoneRevisionsBibTeXCC BY 4.0

Keywords: VQ-VAE, Tractable Probabilistic Models, Model Distillation

TL;DR: VQ-VAEs, though typically seen as intractable, can be distilled into tractable probabilistic circuits by focusing on high-probability latent codes, enabling efficient inference without sacrificing expressiveness.

Abstract: Deep generative models with discrete latent space, such as the Vector-Quantized Variational Autoencoder (VQ-VAE), offer excellent data generation capabilities, but, due to the large size of their latent space, their probabilistic inference is deemed intractable. We demonstrate that the VQ-VAE can be distilled into a tractable model by selecting a subset of latent variables with high probabilities. This simple strategy is particularly efficient, especially if the VQ-VAE underutilizes its latent space, which is, indeed, very often the case. We frame the distilled model as a probabilistic circuit, and show that it preserves expressiveness of the VQ-VAE while providing tractable probabilistic inference. Experiments illustrate competitive performance in density estimation and conditional generation tasks, challenging the view of the VQ-VAE as an inherently intractable model.

Submission Number: 25

Loading