Interpretable AI Reasoning for the Identification of Vibrational Spectroscopic Markers of Acetaminophen Impurities

Published: 04 Mar 2026, Last Modified: 26 Mar 2026ICLR 2026 Workshop LMRL PosterEveryoneRevisionsBibTeXCC BY 4.0
Confirmation: I have read and agree with the workshop's policy on behalf of myself and my co-authors.
Track: long paper (4–8 pages excluding references)
Keywords: artificial intelligence; paracetamol; infrared; vibrational spectroscopic; acetaminophen impurities.
TL;DR: AI can reliably use DFT-based vibrational spectra to explain and distinguish paracetamol from its main synthetic impurities for pharmaceutical quality control.
Abstract: This study evaluates the capability of the Gemini 3 artificial intelligence model to identify and distinguish the active pharmaceutical ingredient Paracetamol (PCA) from its main synthetic impurities, p-aminophenol (PAP) and \iupac{p-nitrophenol} (PNP), based on reasoning grounded in vibrational spectroscopic data. To this end, vibrational mode tables obtained from theoretical calculations using Density Functional Theory (DFT) at the M$06$-$2$X/$6$-$311++$G(d,p) level of theory were employed. The model analyzed diagnostic markers distributed across three distinct spectral regions ($200$ to $4000$ $cm^{-1}$), correlating specific structural variations with their corresponding vibrational signatures. The results demonstrate that Gemini 3 can associate topological and functional differences among molecules with characteristic spectroscopic patterns, yielding interpretable and consistent molecular representations. Consequently, this study highlights the potential of artificial intelligence models as auxiliary tools for automated pharmaceutical quality control, contributing to the reliable identification of impurities in synthetic drug pathways.
Anonymization: This submission has been anonymized for double-blind review via the removal of identifying information such as names, affiliations, and identifying URLs.
Submission Number: 37
Loading