Non-Stationary Learning of Neural Networks with Automatic Soft Parameter Reset

Alexandre Galashov; Michalis Titsias; András György; Clare Lyle; Razvan Pascanu; Yee Whye Teh; Maneesh Sahani

Non-Stationary Learning of Neural Networks with Automatic Soft Parameter Reset

Alexandre Galashov, Michalis Titsias, András György, Clare Lyle, Razvan Pascanu, Yee Whye Teh, Maneesh Sahani

Published: 25 Sept 2024, Last Modified: 14 Jan 2025NeurIPS 2024 posterEveryoneRevisionsBibTeXCC BY 4.0

Keywords: Non-stationarity, plasticity loss, online learning, deep learning

TL;DR: Learning to reset Neural Networks parameters using Ornstein-Uhlenbeck to learn on non-stationary data

Abstract: Neural networks are most often trained under the assumption that data come from a stationary distribution. However, settings in which this assumption is violated are of increasing importance; examples include supervised learning with distributional shifts, reinforcement learning, continual learning and non-stationary contextual bandits. Here, we introduce a novel learning approach that automatically models and adapts to non-stationarity by linking parameters through an Ornstein-Uhlenbeck process with an adaptive drift parameter. The adaptive drift draws the parameters towards the distribution used at initialisation, so the approach can be understood as a form of soft parameter reset. We show empirically that our approach performs well in non-stationary supervised, and off-policy reinforcement learning settings.

Primary Area: Optimization for deep networks

Submission Number: 19536

Loading