2021 (modified: 15 Nov 2022)UAI 2021Readers: Everyone
Abstract:A core challenge in policy optimization in competitive Markov decision processes is the design of efficient optimization methods with desirable convergence and stability properties. We propose comp...