2017 (modified: 11 Nov 2022)ICML 2017Readers: Everyone
Abstract:We propose a method for learning expressive energy-based policies for continuous states and actions, which has been feasible only in tabular domains before. We apply our method to learning maximum ...