SMuCo: Reinforcement Learning for Visual Control via Sequential Multi-view Total Correlation

Published: 26 Apr 2024, Last Modified: 15 Jul 2024UAI 2024 posterEveryoneRevisionsBibTeXCC BY 4.0
Keywords: reinforcement learning, visual control, multi-view total correlation
Abstract: The advent of abundant image data has catalyzed the advancement of visual control in reinforcement learning (RL) systems, leveraging multiple view- points to capture the same physical states, which could enhance control performance theoretically. However, integrating multi-view data into representation learning remains challenging. In this paper, we introduce SMuCo, an innovative multi-view reinforcement learning algorithm that constructs robust latent representations by optimizing multi- view sequential total correlation. This technique effectively captures task-relevant information and temporal dynamics while filtering out irrelevant data. Our method supports an unlimited number of views and demonstrates superior performance over leading model-free and model-based RL algorithms. Empirical results from the DeepMind Control Suite and the Sapien Basic Manipulation Task confirm SMuCo’s enhanced efficacy, significantly improving task performance across diverse scenarios and views.
Supplementary Material: zip
List Of Authors: Cheng, Tong and Dong, Hang and Wang, Lu and Qiao, Bo and Lin, Qingwei and Rajmohan, Saravan and Moscibroda, Thomas
Latex Source Code: zip
Signed License Agreement: pdf
Submission Number: 41
Loading