Unifying Self-Supervised Clustering and Energy-Based Models

Emanuele Sansone; Robin Manhaeve

Unifying Self-Supervised Clustering and Energy-Based Models

Emanuele Sansone, Robin Manhaeve

Published: 15 Aug 2025, Last Modified: 15 Aug 2025Accepted by TMLREveryoneRevisionsBibTeXCC BY 4.0

Abstract: Self-supervised learning excels at learning representations from large amounts of data. At the same time, generative models offer the complementary property of learning information about the underlying data generation process. In this study, we aim at establishing a principled connection between these two paradigms and highlight the benefits of their complementarity. In particular, we perform an analysis of self-supervised learning objectives, elucidating the underlying probabilistic graphical models and presenting a standardized methodology for their derivation from first principles. The analysis suggests a natural means of integrating self-supervised learning with likelihood-based generative models. We instantiate this concept within the realm of cluster-based self-supervised learning and energy models, introducing a lower bound proven to reliably penalize the most important failure modes and unlocking full unification. Our theoretical findings are substantiated through experiments on synthetic and real-world data, including SVHN, CIFAR10, and CIFAR100, demonstrating that our objective function allows to jointly train a backbone network in a discriminative and generative fashion, consequently outperforming existing self-supervised learning strategies in terms of clustering, generation and out-of-distribution detection performance by a wide margin. We also demonstrate that the solution can be integrated into a neuro-symbolic framework to tackle a simple yet non-trivial instantiation of the symbol grounding problem.

Submission Length: Long submission (more than 12 pages of main content)

Changes Since Last Submission: Dear Action Editor, we would like to thank the reviewers and you for the work and the feedbacks received during the review process. We have decided to emphasize in the introduction the three key challenges investigated in the paper (formulation, integration and unification) in order to provide an overarching structure for the whole technical content introduced in the paper and enhance its clarity. Also, the code is now publicly available. Thank you again for the service and the support. \ Kind Regards, \ The authors

Code: https://github.com/emsansone/GEDI.git

Assigned Action Editor: ~Ole_Winther1

Submission Number: 4431

Loading