2019 (modified: 06 Nov 2022)ACML 2019Readers: Everyone
Abstract:Leveraging an equivalence property in the state-space of a Markov Decision Process (MDP) has been investigated in several studies. This paper studies equivalence structure in the reinforcement lear...