Finding planted cliques using gradient descent

Aukosh Jagannath, Reza Gheissari, Yiming Xu

Published: 15 May 2025, Last Modified: 02 May 2026SIAM Journal on Mathematics of Data ScienceEveryonearXiv.org perpetual, non-exclusive license

Abstract: The planted clique problem is a paradigmatic model of statistical-to-computational gaps: the planted clique is information-theoretically detectable if its size k≥2log2n but polynomial-time algorithms only exist for the recovery task when k=Ω(n‾√). By now, there are many algorithms that succeed as soon as k=Ω(n‾√). Glaringly, however, no black-box optimization method, e.g., gradient descent or the Metropolis process, has been shown to work. In fact, Chen, Mossel, and Zadik recently showed that any Metropolis process whose state space is the set of cliques fails to find any sub-linear sized planted clique in polynomial time if initialized naturally from the empty set. We show that using the method of Lagrange multipliers, namely optimizing the Hamiltonian given by the sum of the objective function and the clique constraint over the space of all subgraphs, succeeds. In particular, we prove that Markov chains which minimize this Hamiltonian (gradient descent and a low-temperature relaxation of it) succeed at recovering planted cliques of size k=Ω(n‾√) if initialized from the full graph. Importantly, initialized from the empty set, the relaxation still does not help the gradient descent find sub-linear planted cliques. We also demonstrate robustness of these Markov chain approaches under a natural contamination model.