Hadamard Representations: Augmenting Hyperbolic Tangents in RL

Jacob Eeuwe Kooi; Mark Hoogendoorn; Vincent Francois-Lavet

Hadamard Representations: Augmenting Hyperbolic Tangents in RL

Jacob Eeuwe Kooi, Mark Hoogendoorn, Vincent Francois-Lavet

Published: 01 Aug 2024, Last Modified: 09 Oct 2024EWRL17EveryoneRevisionsBibTeXCC BY 4.0

Keywords: Representation Learning, Reinforcement Learning, Activations

Abstract: Activation functions are one of the key components of a deep neural network. The most commonly used activation functions can be classed into the category of continuously differentiable (e.g. tanh) and linear-unit functions (e.g. ReLU), both having their own strengths and drawbacks with respect to downstream performance and representation capacity through learning (e.g. measured by the number of dead neurons and the effective rank). In reinforcement learning, the performance of continuously differentiable activations often falls short as compared to linear-unit functions. We provide insights into the vanishing gradients associated with the former, and show that the dying neuron problem is not exclusive to ReLU's. To alleviate vanishing gradients and the resulting dying neuron problem occurring with continuously differentiable activations, we propose a Hadamard representation. Using deep Q-networks and proximal policy optimization in the Atari domain, we show faster learning, a reduction in dead neurons and increased effective rank.

Submission Number: 133

Loading