2021 (modified: 28 Feb 2022)ICML 2021Readers: Everyone
Abstract:Batch policy optimization considers leveraging existing data for policy construction before interacting with an environment. Although interest in this problem has grown significantly in recent year...