Published: 01 Jan 2023, Last Modified: 19 Sept 2023ICML 2023Readers: Everyone
Abstract:Algorithms for safely improving policies are important to deploy reinforcement learning approaches in real-world scenarios. In this work, we propose an algorithm, called MCTS-SPIBB, that computes s...