Published: 01 Jan 2022, Last Modified: 09 May 2023ICML 2022Readers: Everyone
Abstract:In a partially observable Markov decision process (POMDP), an agent typically uses a representation of the past to approximate the underlying MDP. We propose to utilize a frozen Pretrained Language...