Meta Learning Backpropagation And Improving It

Louis Kirsch; Jürgen Schmidhuber

Meta Learning Backpropagation And Improving It

Louis Kirsch, Jürgen Schmidhuber

Published: 09 Nov 2021, Last Modified: 05 May 2023NeurIPS 2021 PosterReaders: Everyone

Keywords: meta-learning, general-purpose meta-learning, learned learning rules, fast weights, distributed memory, backpropagation, gradient descent, modularity, self-organization

Abstract: Many concepts have been proposed for meta learning with neural networks (NNs), e.g., NNs that learn to reprogram fast weights, Hebbian plasticity, learned learning rules, and meta recurrent NNs. Our Variable Shared Meta Learning (VSML) unifies the above and demonstrates that simple weight-sharing and sparsity in an NN is sufficient to express powerful learning algorithms (LAs) in a reusable fashion. A simple implementation of VSML where the weights of a neural network are replaced by tiny LSTMs allows for implementing the backpropagation LA solely by running in forward-mode. It can even meta learn new LAs that differ from online backpropagation and generalize to datasets outside of the meta training distribution without explicit gradient calculation. Introspection reveals that our meta learned LAs learn through fast association in a way that is qualitatively different from gradient descent.

Code Of Conduct: I certify that all co-authors of this work have read and commit to adhering to the NeurIPS Statement on Ethics, Fairness, Inclusivity, and Code of Conduct.

TL;DR: Implementing backpropagation in recurrent neural networks and discovering novel general-purpose learning algorithms.

Supplementary Material: pdf

Code: http://louiskirsch.com/code/vsml

15 Replies

Loading