Improving Fragment-Based Deep Molecular Generative Models

Published: 17 Jun 2024, Last Modified: 16 Jul 2024ML4LMS OralEveryoneRevisionsBibTeXCC BY 4.0
Keywords: Deep generative models, molecules, VAE, SMILES, fragment-based
TL;DR: Novel Molecule fragmentation algorithm; works great for use in VAEs
Abstract: Deep molecular generative models have shown promising results and paved a new way for drug discovery. Their ability to explore the molecular space, estimated to be 1060, is significantly greater than traditional methods used for the virtual screening of existing databases. We introduce a novel fragmentation algorithm particularly suitable for use in deep generative models. In contrast to existing fragmentation algorithms, our procedure sequentially breaks a molecule along BRIC bonds in such a manner that the linearization of fragments is directly invertible, guaranteed to be able to reconstruct the original molecule from the fragment sequence. This makes it appropriate for use in deep generative models trained with sequential models as likelihoods. We compare with previous fragment-based SMILES VAE methods and observe that our approach significantly enhances coverage of the molecular space and outperforms on distribution learning benchmarks.
Poster: pdf
Submission Number: 103
Loading