FOP: Factorizing Optimal Joint Policy of Maximum-Entropy Multi-Agent Reinforcement LearningDownload PDFOpen Website

2021 (modified: 31 Mar 2022)ICML 2021Readers: Everyone
Abstract: Value decomposition recently injects vigorous vitality into multi-agent actor-critic methods. However, existing decomposed actor-critic methods cannot guarantee the convergence of global optimum. I...
0 Replies

Loading