Trustworthy Retrosynthesis: Mitigating Hallucinations with Reaction Plausibility Filtering and Retrieval-Augmented Scoring

Michał Sadowski; Maria Wyrzykowska; Lukasz Sztukiewicz; Tadija Radusinović; Jan Rzymkowski; Paweł Włodarczyk-Pruszyński; Mikołaj Sacha; Piotr Kozakowski; Ruard van Workum; Stanislaw Kamil Jastrzebski

Trustworthy Retrosynthesis: Mitigating Hallucinations with Reaction Plausibility Filtering and Retrieval-Augmented Scoring

Michał Sadowski, Maria Wyrzykowska, Lukasz Sztukiewicz, Tadija Radusinović, Jan Rzymkowski, Paweł Włodarczyk-Pruszyński, Mikołaj Sacha, Piotr Kozakowski, Ruard van Workum, Stanislaw Kamil Jastrzebski

Published: 24 Sept 2025, Last Modified: 15 Oct 2025NeurIPS2025-AI4Science PosterEveryoneRevisionsBibTeXCC BY 4.0

Track: Track 1: Original Research/Position/Education/Attention Track

Keywords: Trustworthy ML, Reliable AI, Robustness, Computer-Aided Synthesis Planning (CASP), Retrosynthesis, Reaction Feasibility, Graph Neural Networks, Transformer Models

TL;DR: To fix the problem of AI hallucinating fake chemical reactions, we developed a robust three-part framework based on ML models that filters out implausible predictions, which was a key part of our $1 million prize-winning solution.

Abstract: While Artificial Intelligence (AI) has lead to significant improvements in Computer-Aided Synthesis Planning (CASP), its credibility within the chemical community is fragile. AI retrosynthesis models frequently “hallucinate” chemically implausible reactions, which undermines trust. To address this, we propose a framework that integrates three orthogonal validation strategies to ensure reaction plausibility. This key insight, combining reaction validation strategies which cover different error patterns, was the basis of our winning solution to the \textit{[Anonymized for the sake of blind review]} Retrosynthesis Challange. Our approach combines: (1) a novel Transformer-based model, called Reaction Prior, that estimates reaction likelihood from large-scale experimental data, mimicking chemical reasoning (2) a Graph Neural Network trained on a reaction dataset augmented with synthetically generated incorrect reactions, and (3) a retrieval-based scoring system that leverages chemical databases and grounds suggestions in known chemical literature. The framework was validated on unseen targets through a novel human evaluation process, successfully rejecting the most hallucinated reactions. In this evaluation, chemical experts manually reviewed reactions within the synthetic paths, providing a more reliable and trustworthy form of verification compared to purely automatic methods.

Submission Number: 172

Loading