Constrained Variational Policy Optimization for Safe Reinforcement LearningDownload PDFOpen Website

2022 (modified: 16 Nov 2022)ICML 2022Readers: Everyone
Abstract: Safe reinforcement learning (RL) aims to learn policies that satisfy certain constraints before deploying them to safety-critical applications. Previous primal-dual style approaches suffer from ins...
0 Replies

Loading