ShiftNorm: On Data Efficiency in Reinforcement Learning with Shift NormalizationDownload PDF

04 Mar 2022, 07:18 (edited 16 Apr 2022)ICLR 2022 GPL PosterReaders: Everyone
  • Keywords: Reinforcement Learning, Data Augmentation, Self-Supervised Learning, Representation Learning
  • TL;DR: We focus on high- dimensional image-based reinforcement learnin, and introduce ShiftNorm, a reparameterized data shift method to integrate invariant representations with model-free RL methods.
  • Abstract: We propose ShiftNorm, a simple yet promising data augmentation that can be applied to standard model-free algorithms to improve sample-efficiency in high-dimensional image-based reinforcement learning (RL).Concretely, the differentiable ShiftNorm leverages original samples with reparameterized virtual samples, and hasten the image encoder to generate invariant representations. Our approach demonstrates certify substantial advances, enabling it to outperform the new state-of-the-art on 8 of 9 tasks on the DeepMind Control Suite at 500k steps.
1 Reply