Drug Discovery SMILES-to-Pharmacokinetics Diffusion Models with Deep Molecular Understanding

Published: 11 Jun 2025, Last Modified: 18 Jul 2025GenBio 2025 PosterEveryoneRevisionsBibTeXCC BY 4.0
Keywords: Generative AI, Deep Learning, Drug Discovery
Abstract: The role of Artificial Intelligence (AI) is growing in every stage of drug development. Nevertheless, a major challenge in drug discovery AI remains: Drug pharmacokinetic (PK) datasets collected in different studies often exhibit limited overlap, creating data overlap sparsity. Thus, data curation becomes difficult, negatively impacting downstream research investigations in high-throughput screening, poly-pharmacy, and drug combination. We propose Imagand, a novel SMILES-to-Pharmacokinetic (S2PK) diffusion model capable of generating an array of PK target properties conditioned on SMILES inputs that exhibit data overlap sparsity. We show that Imagand-generated synthetic PK data closely resembles real data univariate and bivariate distributions, and can adequately fill in gaps among PK datasets. As such, Imagand is a promising solution for data overlap sparsity and may improve performance for downstream drug discovery research tasks.
Submission Number: 9
Loading