FragmentFlow: Scalable Transition State Generation for Large Molecules

Published: 02 Mar 2026, Last Modified: 02 Mar 2026AI4Mat-ICLR-2026 PosterEveryoneRevisionsBibTeXCC BY 4.0
Keywords: Flow matching, transition state, distribution shift, chemical reactions, generative modeling
TL;DR: FragmentFlow predicts chemical reaction transition states by modeling only the reactive core atoms, achieving 90% accuracy on large molecules with 30% less computation than traditional methods.
Abstract: Transition states (TSs) are central to understanding and quantitatively predicting chemical reactivity and reaction mechanisms. Although traditional TS generation methods are computationally expensive, recent generative modeling approaches have enabled chemically meaningful TS prediction for relatively small molecules. However, these methods fail to generalize to practically relevant reaction substrates because of distribution shifts induced by increasing molecular sizes. Furthermore, TS geometries for larger molecules are not available at scale, making it infeasible to train generative models from scratch on such molecules. To address these challenges, we introduce FragmentFlow: a divide-and-conquer approach that trains a generative model to predict TS geometries for the reactive core atoms, which define the reaction mechanism. The full TS structure is then reconstructed by re-attaching substituent fragments to the predicted core. By operating on reactive cores, whose size and composition remain relatively invariant across molecular contexts, FragmentFlow mitigates distribution shifts in generative modeling. Evaluated on a new curated dataset of reactions involving reactants with up to 33 heavy atoms, FragmentFlow correctly identifies 90% of TSs while requiring 30% fewer saddle-point optimization steps than classical initialization schemes. These results point toward scalable TS generation for high-throughput reactivity studies.
Submission Track: Full Paper
Submission Category: AI-Guided Design
Submission Number: 42
Loading