2022 (modified: 05 Oct 2022)ICML 2022Readers: Everyone
Abstract:Approaches to policy optimization have been motivated from diverse principles, based on how the parametric model is interpreted (e.g. value versus policy representation) or how the learning objecti...