Mini-Batch Optimization of Contrastive Loss

Kartik Sreenivasan; Keon Lee; Jeong-Gwan Lee; Anna Lee; Jaewoong Cho; Jy-yong Sohn; Dimitris Papailiopoulos; Kangwook Lee

Mini-Batch Optimization of Contrastive Loss

Kartik Sreenivasan, Keon Lee, Jeong-Gwan Lee, Anna Lee, Jaewoong Cho, Jy-yong Sohn, Dimitris Papailiopoulos, Kangwook Lee

Published: 04 Mar 2023, Last Modified: 04 Aug 2025ME-FoMo 2023 PosterReaders: Everyone

Keywords: contrastive learning, mini-batch optimization, batch selection

TL;DR: We provide a theoretical analysis on mini-batch optimization of contrastive loss and propose new principled batch selection algorithms.

Abstract: In this paper, we study the effect of mini-batch selection on contrastive loss and propose new mini-batch selection methods to improve efficiency. Theoretically, we show that both the full-batch and mini-batch settings share the same solution, the simplex Equiangular Tight Frame (ETF), if all $\binom{N}{B}$ mini-batches are seen during training. However, when not all possible batches are seen, mini-batch training can lead to suboptimal solutions. To address this issue, we propose efficient mini-batch selection methods that compare favorably with existing methods. Our experimental results demonstrate the effectiveness of our proposed methods in finding a near-optimal solution with a reduced number of gradient steps and outperforming existing mini-batch selection methods.

Community Implementations: [![CatalyzeX](/images/catalyzex_icon.svg) 2 code implementations](https://www.catalyzex.com/paper/mini-batch-optimization-of-contrastive-loss/code)

0 Replies

Loading