A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning

Harry Zhao; Zhen Liu; Sitao Luan; Shuyuan Zhang; Doina Precup; Yoshua Bengio

A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning

Harry Zhao, Zhen Liu, Sitao Luan, Shuyuan Zhang, Doina Precup, Yoshua Bengio

12 Oct 2021 (modified: 04 May 2025)Deep RL Workshop NeurIPS 2021Readers: Everyone

Keywords: consciousness, planning, reinforcement learning, deep learning

TL;DR: We introduce into reinforcement learning inductive biases inspired by higher-order cognitive functions. These enable the planning to direct attention dynamically to the interesting parts of the state at each step of imagined future trajectories.

Abstract: We present an end-to-end, model-based deep reinforcement learning agent which dynamically attends to relevant parts of its state, in order to plan and to generalize better out-of-distribution. The agent's architecture uses a set representation and a bottleneck mechanism, forcing the number of entities to which the agent attends at each planning step to be small. In experiments, we investigate the bottleneck mechanism with sets of customized environments featuring different dynamics. We consistently observe that the design allows agents to learn to plan effectively, by attending to the relevant objects, leading to better out-of-distribution generalization.

Community Implementations: [![CatalyzeX](/images/catalyzex_icon.svg) 4 code implementations](https://www.catalyzex.com/paper/a-consciousness-inspired-planning-agent-for/code)

0 Replies

Loading