Scalable Safe Policy Improvement via Monte Carlo Tree SearchDownload PDFOpen Website

Published: 01 Jan 2023, Last Modified: 19 Sept 2023ICML 2023Readers: Everyone
Abstract: Algorithms for safely improving policies are important to deploy reinforcement learning approaches in real-world scenarios. In this work, we propose an algorithm, called MCTS-SPIBB, that computes s...
0 Replies

Loading