2021 (modified: 13 Mar 2022)ICML 2021Readers: Everyone
Abstract:In this paper, we provide finite-sample convergence guarantees for an off-policy variant of the natural actor-critic (NAC) algorithm based on Importance Sampling. In particular, we show that the al...