DiffDock: Diffusion Steps, Twists, and Turns for Molecular Docking

Gabriele Corso; Hannes Stärk; Bowen Jing; Regina Barzilay; Tommi S. Jaakkola

DiffDock: Diffusion Steps, Twists, and Turns for Molecular Docking

Gabriele Corso, Hannes Stärk, Bowen Jing, Regina Barzilay, Tommi S. Jaakkola

Published: 29 Nov 2022, Last Modified: 15 Feb 2026SBM 2022 OralReaders: Everyone

Keywords: molecular docking, protein-ligand binding, diffusion models, score-based models, molecular structure, equivariance, geometric deep learning

TL;DR: Molecular docking via non-Euclidean diffusion modeling and confidence estimation

Abstract: Predicting the binding structure of a small molecule ligand to a protein---a task known as molecular docking---is critical to drug design. Recent deep learning methods that treat docking as a regression problem have decreased runtime compared to traditional search-based methods but have yet to offer substantial improvements in accuracy. We instead frame molecular docking as a \emph{generative} modeling problem and develop DiffDock, a diffusion generative model over the non-Euclidean manifold of ligand poses. To do so, we map this manifold to the product space of the degrees of freedom (translational, rotational, and torsional) involved in docking and develop an efficient diffusion process on this space. Empirically, DiffDock obtains a 38% top-1 success rate (RMSD < 2Å) on PDBBind, significantly outperforming the previous state-of-the-art of traditional docking (23%) and deep learning (20%) methods. Moreover, DiffDock has fast inference times and provides confidence estimates with high selective accuracy.

Student Paper: Yes

Community Implementations: [![CatalyzeX](/images/catalyzex_icon.svg) 5 code implementations](https://www.catalyzex.com/paper/diffdock-diffusion-steps-twists-and-turns-for/code)

1 Reply

Loading