Implicit Offline Reinforcement Learning via Supervised Learning

Alexandre Piché; Rafael Pardinas; David Vazquez; Igor Mordatch; Igor Mordatch; Christopher Pal

Implicit Offline Reinforcement Learning via Supervised Learning

Alexandre Piché, Rafael Pardinas, David Vazquez, Igor Mordatch, Igor Mordatch, Christopher Pal

Published: 01 Feb 2023, Last Modified: 13 Feb 2023Submitted to ICLR 2023Readers: Everyone

Keywords: Offline Reinforcement Learning, Energy Based Model, Offline Reinforcement Learning via Supervised Learning

TL;DR: This work bridged an essential gap between implicit models and explicit RL via Supervised Learning methods.

Abstract: Offline Reinforcement Learning (RL) via Supervised Learning is a simple and effective way to learn robotic skills from a dataset of varied behaviors. It is as simple as supervised learning and Behavior Cloning (BC) but takes advantage of the return information. On BC tasks, implicit models have been shown to match or outperform explicit ones. Despite the benefits of using implicit models to learn robotic skills via BC, Offline RL via Supervised Learning algorithms have been limited to explicit models. We show how implicit models leverage return information and match or outperform explicit algorithms to acquire robotic skills from fixed datasets. Furthermore, we show how closely related our implicit methods are to other popular RL via Supervised Learning algorithms.

Anonymous Url: I certify that there is no URL (e.g., github page) that could be used to find authors’ identity.

No Acknowledgement Section: I certify that there is no acknowledgement section in this submission for double blind review.

Code Of Ethics: I acknowledge that I and all co-authors of this work have read and commit to adhering to the ICLR Code of Ethics

Submission Guidelines: Yes

Please Choose The Closest Area That Your Submission Falls Into: Reinforcement Learning (eg, decision and control, planning, hierarchical RL, robotics)

Supplementary Material: zip

8 Replies

Loading