Learning Diverse Policies in MOBA Games via Macro-Goals

Yiming Gao; Bei Shi; Xueying Du; Liang Wang; Guangwei Chen; Zhenjie Lian; Fuhao Qiu; GUOAN HAN; Weixuan Wang; Deheng Ye; QIANG FU; Yang Wei; Lanxiao Huang

Learning Diverse Policies in MOBA Games via Macro-Goals

Yiming Gao, Bei Shi, Xueying Du, Liang Wang, Guangwei Chen, Zhenjie Lian, Fuhao Qiu, GUOAN HAN, Weixuan Wang, Deheng Ye, QIANG FU, Yang Wei, Lanxiao Huang

Published: 09 Nov 2021, Last Modified: 05 May 2023NeurIPS 2021 PosterReaders: Everyone

Keywords: Deep Reinforcement Learning, Diverse Policies, Game Playing, Goal-based Learning

TL;DR: We propose a novel learning paradigm named Macro-Goals Guided (MGG) learning to train diverse policies in MOBA games.

Abstract: Recently, many researchers have made successful progress in building the AI systems for MOBA-game-playing with deep reinforcement learning, such as on Dota 2 and Honor of Kings. Even though these AI systems have achieved or even exceeded human-level performance, they still suffer from the lack of policy diversity. In this paper, we propose a novel Macro-Goals Guided framework, called MGG, to learn diverse policies in MOBA games. MGG abstracts strategies as macro-goals from human demonstrations and trains a Meta-Controller to predict these macro-goals. To enhance policy diversity, MGG samples macro-goals from the Meta-Controller prediction and guides the training process towards these goals. Experimental results on the typical MOBA game Honor of Kings demonstrate that MGG can execute diverse policies in different matches and lineups, and also outperform the state-of-the-art methods over 102 heroes.

Code Of Conduct: I certify that all co-authors of this work have read and commit to adhering to the NeurIPS Statement on Ethics, Fairness, Inclusivity, and Code of Conduct.

Supplementary Material: pdf

18 Replies

Loading