Greedy based Value Representation for Optimal Coordination in Multi-agent Reinforcement LearningDownload PDFOpen Website

2022 (modified: 01 Nov 2022)ICML 2022Readers: Everyone
Abstract: Due to the representation limitation of the joint Q value function, multi-agent reinforcement learning methods with linear value decomposition (LVD) or monotonic value decomposition (MVD) suffer fr...
0 Replies

Loading