Lifting the Veil on Hyper-parameters for Value-based Deep Reinforcement Learning

João Guilherme Madeira Araújo; Johan Samir Obando Ceron; Pablo Samuel Castro

Lifting the Veil on Hyper-parameters for Value-based Deep Reinforcement Learning

João Guilherme Madeira Araújo, Johan Samir Obando Ceron, Pablo Samuel Castro

12 Oct 2021 (modified: 05 May 2023)Deep RL Workshop NeurIPS 2021Readers: Everyone

Keywords: Reinforcement Learning, Deep Reinforcement Learning, Value based

TL;DR: We conduct a thorough investigation of many, often overlooked, hyperparameters used in value-based deep RL

Abstract: Successful applications of deep reinforcement learning (deep RL) combine algorithmic design and careful hyper-parameter selection. The former often comes from iterative improvements over existing algorithms, while the latter is either inherited from prior methods or tuned for the specific method being introduced. Although critical to a method's performance, the effect of the various hyper-parameter choices are often overlooked in favour of algorithmic advances. In this paper, we perform an initial empirical investigation into a number of often-overlooked hyper-parameters for value-based deep RL agents, demonstrating their varying levels of importance. We conduct this study on a varied set of classic control environments which helps highlight the effect each environment has on an algorithm's hyper-parameter sensitivity.

Supplementary Material: zip

0 Replies

Loading