Uncertainty-Aware Discrete Diffusion Improves Protein Design

ICML 2025 Workshop FM4LS Submission58 Authors

Published: 12 Jul 2025, Last Modified: 12 Jul 2025FM4LS 2025EveryoneRevisionsBibTeXCC BY 4.0
Keywords: Protein Inverse Folding, Protein Desing, Discrete Diffusion Models, Uncertainty-Award Guidance
TL;DR: We propose an uncertainty-aware discrete diffusion model for protein inverse folding that significantly improves performance by guiding denoising with learned prior-posterior uncertainty signals.
Abstract: Protein inverse folding involves generating amino acid sequences that adopt a specified 3D structure---a key challenge in structural biology and molecular engineering. While discrete diffusion models have demonstrated strong performance, existing methods often apply uniform denoising across residues, overlooking position-specific uncertainty. We propose an uncertainty-aware discrete denoising diffusion model that employs a prior-posterior signaling mechanism to dynamically guide the denoising process. Our approach further integrates learned priors from a pretrained protein large language model and a structure encoder within a modular framework, jointly optimized through multi-objective training. Across multiple benchmarks, our method achieves substantial improvements over state-of-the-art baselines, offering a principled framework for structure-conditioned sequence generation in proteins and beyond.
Submission Number: 58
Loading