Off-Belief LearningDownload PDFOpen Website

2021 (modified: 16 May 2022)ICML 2021Readers: Everyone
Abstract: The standard problem setting in Dec-POMDPs is self-play, where the goal is to find a set of policies that play optimally together. Policies learned through self-play may adopt arbitrary conventions...
0 Replies

Loading