Provably Efficient Model-Free Algorithms for Non-stationary CMDPsDownload PDFOpen Website

Published: 2023, Last Modified: 29 Sept 2023AISTATS 2023Readers: Everyone
Abstract: We study model-free reinforcement learning (RL) algorithms in episodic non-stationary constrained Markov decision processes (CMDPs), in which an agent aims to maximize the expected cumulative rewar...
0 Replies

Loading