2021 (modified: 24 Feb 2022)ICML 2021Readers: Everyone
Abstract:Policies for partially observed Markov decision processes can be efficiently learned by imitating expert policies generated using asymmetric information. Unfortunately, existing approaches for this...