Model-Free Robust Average-Reward Reinforcement LearningDownload PDFOpen Website

Published: 01 Jan 2023, Last Modified: 29 Sept 2023ICML 2023Readers: Everyone
Abstract: Robust Markov decision processes (MDPs) address the challenge of model uncertainty by optimizing the worst-case performance over an uncertainty set of MDPs. In this paper, we focus on the robust av...
0 Replies

Loading