2022 (modified: 01 Nov 2022)ICML 2022Readers: Everyone
Abstract:Due to the representation limitation of the joint Q value function, multi-agent reinforcement learning methods with linear value decomposition (LVD) or monotonic value decomposition (MVD) suffer fr...