In reinforcement learning, all objective functions are not equal

Romain Laroche \& Harm van Seijen

In reinforcement learning, all objective functions are not equal

Romain Laroche \& Harm van Seijen,

12 Feb 2018 (modified: 05 May 2023)ICLR 2018 Workshop SubmissionReaders: Everyone

Abstract: We study the learnability of value functions. We get the reward back propagation out of the way by fitting directly a deep neural network on the analytically computed optimal value function, given a chosen objective function. We show that some objective functions are easier to train than others by several magnitude orders. We observe in particular the influence of the $\gamma$ parameter and the decomposition of the task into subtasks.

TL;DR: In reinforcement learning, all objective functions are not equal

Keywords: reinforcement learning, deep learning

3 Replies

Loading