Recurrent Independent Mechanisms

Anirudh Goyal; Alex Lamb; Jordan Hoffmann; Shagun Sodhani; Sergey Levine; Yoshua Bengio; Bernhard Schölkopf

Recurrent Independent Mechanisms

Anirudh Goyal, Alex Lamb, Jordan Hoffmann, Shagun Sodhani, Sergey Levine, Yoshua Bengio, Bernhard Schölkopf

Published: 12 Jan 2021, Last Modified: 22 Jun 2025ICLR 2021 SpotlightReaders: Everyone

Keywords: modular representations, better generalization, learning mechanisms

Abstract: We explore the hypothesis that learning modular structures which reflect the dynamics of the environment can lead to better generalization and robustness to changes that only affect a few of the underlying causes. We propose Recurrent Independent Mechanisms (RIMs), a new recurrent architecture in which multiple groups of recurrent cells operate with nearly independent transition dynamics, communicate only sparingly through the bottleneck of attention, and compete with each other so they are updated only at time steps where they are most relevant. We show that this leads to specialization amongst the RIMs, which in turn allows for remarkably improved generalization on tasks where some factors of variation differ systematically between training and evaluation.

One-sentence Summary: Learning recurrent mechanisms which operate independently, and sparingly interact can lead to better generalization to out of distribution samples.

Code Of Ethics: I acknowledge that I and all co-authors of this work have read and commit to adhering to the ICLR Code of Ethics

Supplementary Material: zip

Code: [![Papers with Code](/images/pwc_icon.svg) 3 community implementations](https://paperswithcode.com/paper/?openreview=mLcmdlEUxy-)

Community Implementations: [![CatalyzeX](/images/catalyzex_icon.svg) 4 code implementations](https://www.catalyzex.com/paper/recurrent-independent-mechanisms/code)

14 Replies

Loading