Published: 2023, Last Modified: 29 Sept 2023AISTATS 2023Readers: Everyone
Abstract:We study model-free reinforcement learning (RL) algorithms in episodic non-stationary constrained Markov decision processes (CMDPs), in which an agent aims to maximize the expected cumulative rewar...