Addressing Optimism Bias in Sequence Modeling for Reinforcement Learning

Adam R. Villaflor, Zhe Huang, Swapnil Pande, John M. Dolan, Jeff Schneider

Published: 2022, Last Modified: 16 May 2023ICML 2022Readers: Everyone

Abstract: Impressive results in natural language processing (NLP) based on the Transformer neural network architecture have inspired researchers to explore viewing offline reinforcement learning (RL) as a ge...

0 Replies