Factored Action Spaces in Deep Reinforcement Learning

Thomas PIERROT; Valentin Macé; Jean-Baptiste Sevestre; Louis Monier; Alexandre Laterre; Nicolas Perrin; Karim Beguir; Olivier Sigaud

Factored Action Spaces in Deep Reinforcement Learning

Thomas PIERROT, Valentin Macé, Jean-Baptiste Sevestre, Louis Monier, Alexandre Laterre, Nicolas Perrin, Karim Beguir, Olivier Sigaud

28 Sept 2020 (modified: 05 May 2023)ICLR 2021 Conference Blind SubmissionReaders: Everyone

Keywords: Deep Reinforcement Learning, Large action spaces, Parameterized action spaces, Multi-Agent, Continuous Control

Abstract: Very large action spaces constitute a critical challenge for deep Reinforcement Learning (RL) algorithms. An existing approach consists in splitting the action space into smaller components and choosing either independently or sequentially actions in each dimension. This approach led to astonishing results for the StarCraft and Dota 2 games, however it remains underexploited and understudied. In this paper, we name this approach Factored Actions Reinforcement Learning (FARL) and study both its theoretical impact and practical use. Notably, we provide a theoretical analysis of FARL on the Proximal Policy Optimization (PPO) and Soft Actor Critic (SAC) algorithms and evaluate these agents in different classes of problems. We show that FARL is a very versatile and efficient approach to combinatorial and continuous control problems.

One-sentence Summary: We propose a theoretical study as well as practical tips and applications of action spaces factorization in deep Reinforcement Learning.

Code Of Ethics: I acknowledge that I and all co-authors of this work have read and commit to adhering to the ICLR Code of Ethics

Reviewed Version (pdf): https://openreview.net/references/pdf?id=4pWc9uceDlO

8 Replies

Loading