Recurrent Independent Mechanisms

Anirudh Goyal; Alex Lamb; Shagun Sodhani; Jordan Hoffmann; Sergey Levine; Yoshua Bengio; Bernhard Scholkopf

Recurrent Independent Mechanisms

Anirudh Goyal, Alex Lamb, Shagun Sodhani, Jordan Hoffmann, Sergey Levine, Yoshua Bengio, Bernhard Scholkopf

25 Sept 2019 (modified: 26 May 2025)ICLR 2020 Conference Blind SubmissionReaders: Everyone

TL;DR: Learning recurrent mechanisms which operate independently, and sparingly interact can lead to better generalization to out of distribution samples.

Abstract: Learning modular structures which reflect the dynamics of the environment can lead to better generalization and robustness to changes which only affect a few of the underlying causes. We propose Recurrent Independent Mechanisms (RIMs), a new recurrent architecture in which multiple groups of recurrent cells operate with nearly independent transition dynamics, communicate only sparingly through the bottleneck of attention, and are only updated at time steps where they are most relevant. We show that this leads to specialization amongst the RIMs, which in turn allows for dramatically improved generalization on tasks where some factors of variation differ systematically between training and evaluation.

Keywords: modular representations, better generalization, learning mechanisms

Community Implementations: [![CatalyzeX](/images/catalyzex_icon.svg) 4 code implementations](https://www.catalyzex.com/paper/recurrent-independent-mechanisms/code)

Original Pdf: pdf

17 Replies

Loading