AceGen:  A TorchRL-based toolkit for reinforcement learning in generative chemistry

Albert Bou; Morgan Thomas; Sebastian Dittert; Carles Navarro Ramírez; Maciej Majewski; Ye Wang; Shivam Patel; Gary Tresadern; Mazen Ahmad; Vincent Moens; Woody Sherman; simone sciabola; Gianni De Fabritiis

AceGen: A TorchRL-based toolkit for reinforcement learning in generative chemistry

Albert Bou, Morgan Thomas, Sebastian Dittert, Carles Navarro Ramírez, Maciej Majewski, Ye Wang, Shivam Patel, Gary Tresadern, Mazen Ahmad, Vincent Moens, Woody Sherman, simone sciabola, Gianni De Fabritiis

Published: 04 Mar 2024, Last Modified: 29 Apr 2024GEM PosterEveryoneRevisionsBibTeXCC BY 4.0

Track: Machine learning: computational method and/or computational results

Keywords: Reinforcement Learning, Drug Discovery, Drug Design, TorchRL

TL;DR: A toolkit for drug design using reinforcement learning and TorchRL, offering simplified and more efficient implementations of known algorithms and a more sample efficient algorithm for language-based for de novo drug discovery.

Abstract: In recent years, reinforcement learning (RL) has been increasingly used in drug design to propose molecules with specific properties under defined constraints. However, RL problems are inherently complex, featuring independent and interchangeable components with diverse method signatures and data requirements, leading existing applications to convoluted code structures. This complexity not only complicates code comprehension but also hampers modification, hindering the smooth exploration of new ideas in the field and ultimately slowing down research. In this work, we apply TorchRL - a modern general decision-making library that provides well-integrated reusable components - to make a robust toolkit tailored for generative drug design. AceGen leverages general RL solutions which enhance simplicity, making the solutions more understandable, modifiable, and reliable. We demonstrate the application of AceGen for conditioned compound library generation implementing various RL algorithms to optimize drug design targets. Furthermore, with the tools made available we propose a novel algorithm inspired by the PPOD algorithm that outperforms all baselines as benchmarked on 23 drug design relevant targets. The library is accessible at https://anonymous.4open.science/r/acegen-open-23D3.

Submission Number: 71

Loading