OpenReview
.net
OpenReview
.net
Login
OpenReview
.net
Login
Go to
OpenReview Public Article DBLP
homepage
Mitigating OOD overoptimism via in-sample value function in offline reinforcement learning
Wenhui Liu
,
Kangyang Luo
,
Zhijian Wu
,
Shanfeng Hao
,
Dingjiang Huang
Published: 2026, Last Modified: 07 May 2026
Neural Networks 2026
Everyone
Revisions
BibTeX
CC BY-SA 4.0
External IDs:
dblp:journals/nn/LiuLWHH26
Loading