Learning to solve the credit assignment problem

Benjamin James Lansdell; Prashanth Ravi Prakash; Konrad Paul Kording

Learning to solve the credit assignment problem

Benjamin James Lansdell, Prashanth Ravi Prakash, Konrad Paul Kording

Published: 20 Dec 2019, Last Modified: 22 Jun 2025ICLR 2020 Conference Blind SubmissionReaders: Everyone

Keywords: biologically plausible deep learning, node perturbation, REINFORCE, synthetic gradients, feedback alignment

TL;DR: Perturbations can be used to train feedback weights to learn in fully connected and convolutional neural networks

Abstract: Backpropagation is driving today's artificial neural networks (ANNs). However, despite extensive research, it remains unclear if the brain implements this algorithm. Among neuroscientists, reinforcement learning (RL) algorithms are often seen as a realistic alternative: neurons can randomly introduce change, and use unspecific feedback signals to observe their effect on the cost and thus approximate their gradient. However, the convergence rate of such learning scales poorly with the number of involved neurons. Here we propose a hybrid learning approach. Each neuron uses an RL-type strategy to learn how to approximate the gradients that backpropagation would provide. We provide proof that our approach converges to the true gradient for certain classes of networks. In both feedforward and convolutional networks, we empirically show that our approach learns to approximate the gradient, and can match the performance of gradient-based learning. Learning feedback weights provides a biologically plausible mechanism of achieving good performance, without the need for precise, pre-specified learning rules.

Code: https://github.com/benlansdell/synthfeedback

Community Implementations: [![CatalyzeX](/images/catalyzex_icon.svg) 1 code implementation](https://www.catalyzex.com/paper/learning-to-solve-the-credit-assignment/code)

Original Pdf: pdf

10 Replies

Loading