Self-supervised Color Generalization in Reinforcement Learning

Published: 28 Oct 2024, Last Modified: 28 Oct 2024Accepted by TMLREveryoneRevisionsBibTeXCC BY 4.0
Abstract: A challenge in reinforcement learning lies in effectively deploying trained policies to handle out-of-distribution data and environmental variations. Agents observing pixel-based image data are generally sensitive to background distractions and color changes. Commonly, color generalization is achieved through data augmentation. In contrast, we propose a color-invariant neural network layer that adopts distinct color symmetries in a self-supervised fashion. This allows for color sensitivity while achieving generalization. Our approach is based on dynamic-mode decomposition, which also accommodates spatial and temporal symmetries; we discuss the controlled breaking of the latter. We empirically evaluate our method in the Minigrid, Procgen, and DeepMind Control suites and find improved color sensitivity and generalisation.
Submission Length: Regular submission (no more than 12 pages of main content)
Previous TMLR Submission Url: https://openreview.net/forum?id=sF3KidMleK&referrer=%5BAuthor%20Console%5D(%2Fgroup%3Fid%3DTMLR%2FAuthors%23your-submissions)
Changes Since Last Submission: \cite replaced by \citep
Code: https://github.com/matthias-weissenbacher/CiL
Supplementary Material: zip
Assigned Action Editor: ~Oleg_Arenz1
Submission Number: 2953
Loading