History Compression via Language Models in Reinforcement LearningDownload PDFOpen Website

Published: 01 Jan 2022, Last Modified: 09 May 2023ICML 2022Readers: Everyone
Abstract: In a partially observable Markov decision process (POMDP), an agent typically uses a representation of the past to approximate the underlying MDP. We propose to utilize a frozen Pretrained Language...
0 Replies

Loading