Growing Up Together: Structured Exploration for Large Action Spaces

Gabriel Synnaeve; Jonas Gehring; Zeming Lin; Daniel Haziza; Nicolas Usunier; Danielle Rothermel; Vegard Mella; Da Ju; Nicolas Carion; Laura Gustafson; Daniel Gant

Growing Up Together: Structured Exploration for Large Action Spaces

Gabriel Synnaeve, Jonas Gehring, Zeming Lin, Daniel Haziza, Nicolas Usunier, Danielle Rothermel, Vegard Mella, Da Ju, Nicolas Carion, Laura Gustafson, Daniel Gant

25 Sept 2019 (modified: 05 May 2023)ICLR 2020 Conference Withdrawn SubmissionReaders: Everyone

Abstract: Training good policies for large combinatorial action spaces is onerous and usually tackled with imitation learning, curriculum learning, or reward shaping. Each of these methods has requirements that can hinder their general application. Here, we study how growing the action space of the policy during training can structure the exploration and lead to convergence without any external data (imitation), with less control over the environment (curriculum), and with minimal reward shaping. We evaluate this approach on a challenging end-to-end full games army control task in StarCraft: Brood War by training policies through self-play from scratch. We grow the spatial resolution and frequency of actions and achieve superior results compared to operating purely at finer resolutions.

Keywords: Reinforcement Learning, Real-Time Strategy Games, Hierarchical RL, Large Action Space

Original Pdf: pdf

1 Reply

Loading