Addressing Optimism Bias in Sequence Modeling for Reinforcement LearningDownload PDFOpen Website

Published: 01 Jan 2022, Last Modified: 16 May 2023ICML 2022Readers: Everyone
Abstract: Impressive results in natural language processing (NLP) based on the Transformer neural network architecture have inspired researchers to explore viewing offline reinforcement learning (RL) as a ge...
0 Replies

Loading