Constrained Upper Confidence Reinforcement LearningDownload PDFOpen Website

2020 (modified: 17 Apr 2023)L4DC 2020Readers: Everyone
Abstract: Constrained Markov Decision Processes are a class of stochastic decision problems in which the decision maker must select a policy that satisfies auxiliary cost constraints. This paper extends uppe...
0 Replies

Loading