Representation-Driven Reinforcement Learning

Ofir Nabati; Guy Tennenholtz; Shie Mannor

Representation-Driven Reinforcement Learning

Ofir Nabati, Guy Tennenholtz, Shie Mannor

Published: 20 Jul 2023, Last Modified: 31 Aug 2023EWRL16Readers: Everyone

Keywords: reinforcement learning, representation learning, exploration

TL;DR: Employing linear bandits with a learned linear policy representation for solving reinforcement learning problems.

Abstract: We present a representation-driven framework for reinforcement learning. By representing policies as estimates of their expected values, we leverage techniques from contextual bandits to guide exploration and exploitation. Particularly, embedding a policy network into a linear feature space allows us to reframe the exploration-exploitation problem as a representation-exploitation problem, where good policy representations enable optimal exploration. We demonstrate the effectiveness of this framework through its application to evolutionary and policy gradient-based approaches, leading to significantly improved performance compared to traditional methods. Our framework provides a new perspective on reinforcement learning, highlighting the importance of policy representation in determining optimal exploration-exploitation strategies.

Already Accepted Paper At Another Venue: already accepted somewhere else

1 Reply

Loading