VICE: Variational Interpretable Concept Embeddings

Lukas Muttenthaler; Charles Yang Zheng; Patrick McClure; Robert A. Vandermeulen; Martin N Hebart; Francisco Pereira

VICE: Variational Interpretable Concept Embeddings

Lukas Muttenthaler, Charles Yang Zheng, Patrick McClure, Robert A. Vandermeulen, Martin N Hebart, Francisco Pereira

Published: 31 Oct 2022, Last Modified: 06 Apr 2025NeurIPS 2022 AcceptReaders: Everyone

Keywords: Cognitive Science, Latent Variable Models, Variational Inference, Concept Representations

TL;DR: This paper introduces Variational Interpretable Concept Embeddings, an approximate Bayesian method for learning interpretable object concept embeddings from human behavior in an odd-one-out task.

Abstract: A central goal in the cognitive sciences is the development of numerical models for mental representations of object concepts. This paper introduces Variational Interpretable Concept Embeddings (VICE), an approximate Bayesian method for embedding object concepts in a vector space using data collected from humans in a triplet odd-one-out task. VICE uses variational inference to obtain sparse, non-negative representations of object concepts with uncertainty estimates for the embedding values. These estimates are used to automatically select the dimensions that best explain the data. We derive a PAC learning bound for VICE that can be used to estimate generalization performance or determine a sufficient sample size for experimental design. VICE rivals or outperforms its predecessor, SPoSE, at predicting human behavior in the triplet odd-one-out task. Furthermore, VICE's object representations are more reproducible and consistent across random initializations, highlighting the unique advantage of using VICE for deriving interpretable embeddings from human behavior.

Supplementary Material: pdf

Community Implementations: [![CatalyzeX](/images/catalyzex_icon.svg) 1 code implementation](https://www.catalyzex.com/paper/vice-variational-interpretable-concept/code)

12 Replies

Loading