Policy Regularization with Dataset Constraint for Offline Reinforcement Learning

Yuhang Ran, Yi-Chen Li, Fuxiang Zhang, Zongzhang Zhang, Yang Yu

Published: 2023, Last Modified: 04 Sept 2023ICML 2023Readers: Everyone

Abstract: We consider the problem of learning the best possible policy from a fixed dataset, known as offline Reinforcement Learning (RL). A common taxonomy of existing offline RL works is policy regularizat...

0 Replies