Greedy based Value Representation for Optimal Coordination in Multi-agent Reinforcement Learning

Lipeng Wan, Zeyang Liu, Xingyu Chen, Xuguang Lan, Nanning Zheng

2022 (modified: 01 Nov 2022)ICML 2022Readers: Everyone

Abstract: Due to the representation limitation of the joint Q value function, multi-agent reinforcement learning methods with linear value decomposition (LVD) or monotonic value decomposition (MVD) suffer fr...

0 Replies