How to Reward Your Drug Agent?

Andrea Karlova; Wim Dehaen; Andrei Penciu

How to Reward Your Drug Agent?

Andrea Karlova, Wim Dehaen, Andrei Penciu

Published: 08 Dec 2021, Last Modified: 05 May 2023DGMs and Applications @ NeurIPS 2021 PosterReaders: Everyone

Keywords: reinforcement learning in drug discovery, choice of reward function, novel molecular structures, molecular optimisation, molecular generation

TL;DR: DAFT: highly customisable reward function for generating realistic molecular structures for various drug applications

Abstract: Constructing novel molecules from scratch using deep generative models provides useful alternative to traditional virtual screening methods which are limited to the search of the already discovered chemicals. In particular, molecular optimisation combined with sampling guided by reinforcement learning seems like a promising path for discovering novel molecular designs and allows for domain-specific customization of the desired solutions. The choice of a chemically relevant reward function and the exhaustive assessment of its properties remains a challenging task. We introduce the reward function which gives enough flexibility to quantify the biological activity with respect to a selected protein target, drug-likeness, synthesizability and incorporates the custom index of penalised physico-chemical properties. In order to customise the hyper-parameters influencing the RL agent performance, we propose the methodology which helps to quantify the chemical relevance of the reward function by quantifying the chemical relevance of the samples. We assess the performance of the reward function by docking the molecules with relevant protein targets and quantify the difference with the ground truth samples using Wasserstein distance.

1 Reply

Loading