Using adaptive intrinsic motivation in RL to model learning across development

Kai Jappe Sandbrink; Brian Christian; Linas Nasvytis; Christian Schroeder de Witt; Patrick Butlin

Using adaptive intrinsic motivation in RL to model learning across development

Kai Jappe Sandbrink, Brian Christian, Linas Nasvytis, Christian Schroeder de Witt, Patrick Butlin

Published: 09 Oct 2024, Last Modified: 02 Dec 2024NeurIPS 2024 Workshop IMOL PosterEveryoneRevisionsBibTeXCC BY 4.0

Track: Full track

Keywords: reinforcement learning, intrinsic motivation, meta-RL, single-life RL

Abstract: Reinforcement learning is a powerful model of animal learning in brief, controlled experimental conditions, but does not readily explain the development of behavior over an animal's whole lifetime. In this paper, we describe a framework to address this shortcoming by introducing the single-life reinforcement learning setting to cognitive science. We construct an agent with two learning systems: an extrinsic learner that learns within a single lifetime, and an intrinsic learner that learns across lifetimes, equipping the agent with intrinsic motivation. We show that this model outperforms heuristic benchmarks and recapitulates a transition from exploratory to habit-driven behavior, while allowing the agent to learn an interpretable value function. We formulate a precise definition of intrinsic motivation and discuss the philosophical implications of using reinforcement learning as a model of behavior in the real world.

Submission Number: 46

Loading