Xi-learning: Successor Feature Transfer Learning for General Reward Functions

Chris Reinke; Xavier Alameda-Pineda

Xi-learning: Successor Feature Transfer Learning for General Reward Functions

Chris Reinke, Xavier Alameda-Pineda

Published: 28 Jan 2022, Last Modified: 22 Jun 2025ICLR 2022 SubmittedReaders: Everyone

Keywords: reinforcement learning, transfer learning, meta learning, successor features

Abstract: Transfer in Reinforcement Learning aims to improve learning performance on target tasks using knowledge from experienced source tasks. Successor features (SF) are a prominent transfer mechanism in domains where the reward function changes between tasks. They reevaluate the expected return of previously learned policies in a new target task and to transfer their knowledge. A limiting factor of the SF framework is its assumption that rewards linearly decompose into successor features and a reward weight vector. We propose a novel SF mechanism, $\xi$-learning, based on learning the cumulative discounted probability of successor features. Crucially, $\xi$-learning allows to reevaluate the expected return of policies for general reward functions. We introduce two $\xi$-learning variations, prove its convergence, and provide a guarantee on its transfer performance. Experimental evaluations based on $\xi$-learning with function approximation demonstrate the prominent advantage of $\xi$-learning over available mechanisms not only for general reward functions, but also in the case of linearly decomposable reward functions.

One-sentence Summary: Transfer Reinforcement Learning method allowing to use Successor Features with any reward function, instead of only linearily decomposable ones.

Community Implementations: [![CatalyzeX](/images/catalyzex_icon.svg) 1 code implementation](https://www.catalyzex.com/paper/xi-learning-successor-feature-transfer/code)

16 Replies

Loading