The Benefits of Self-Supervised Learning for Training Physical Neural Networks

Jeremie Laydevant; Peter McMahon; Davide Venturelli; Paul Aaron Lott

The Benefits of Self-Supervised Learning for Training Physical Neural Networks

Jeremie Laydevant, Peter McMahon, Davide Venturelli, Paul Aaron Lott

Published: 01 Nov 2023, Last Modified: 22 Dec 2023MLNCP PosterEveryoneRevisionsBibTeX

Keywords: Physical Neural Networks, Self-supervised learning, Noise robustness, Sparsity, Pruning, Greedy layer-wise training

TL;DR: We investigate the use of Self-supervised learning (SSL) for training Physical Neural Networks and show emerging properties of robustness to noise, to pruning and explore greedy layer-wise training with intermediate SSL objectives.

Abstract: Physical Neural Networks (PNNs) are energy-efficient alternatives to their digital counterparts. Because they are inherently variable, noisy and hardly differentiable, PNNs require tailored trainign methods. Additionally, while the properties of PNNs make them good candidates for edge computing, where memory and computational ressources are constrained, most of the training algorithms developed for training PNNs focus on supervised learning, though labeled data could not be accessible on the edge. Here, we propose to use Self-Supervised Learning (SSL) as an ideal framework for training PNNs (we focus here on computer vision tasks) : 1. SSL globally eliminates the reliance on labeled data and 2. as SSL enforces the network to extract high-level concepts, networks trained with SSL should result in high robustness to noise and device variability. We investigate and show with simulations that the later properties effectively emerge when a network is trained on MNIST in the SSL settings while it does not when trained supervisely. We also explore and show empirically that we can optimize layer-wise SSL objectives rather than a single global one while still achieving the performance of the global optimization on MNIST and CIFAR-10. This could allow local learning without backpropagation at all, especially in the scheme we propose with stochastic optimization. We expect this preliminary work, based on simulations, to pave the way of a robust paradigm for training PNNs and hope to stimulate interest in the community of unconventional computing and beyond.

Submission Number: 24

Loading