Offline-to-Online Reinforcement Learning via Balanced Replay and Pessimistic Q-EnsembleDownload PDFOpen Website

2021 (modified: 18 Mar 2022)CoRL 2021Readers: Everyone
Abstract: Recent advance in deep offline reinforcement learning (RL) has made it possible to train strong robotic agents from offline datasets. However, depending on the quality of the trained agents and the...
0 Replies

Loading