Learning symmetries via weight-sharing with doubly stochastic tensors

Putri A Van der Linden; Alejandro García Castellanos; Sharvaree Vadgama; Thijs P. Kuipers; Erik J Bekkers

Learning symmetries via weight-sharing with doubly stochastic tensors

Putri A Van der Linden, Alejandro García Castellanos, Sharvaree Vadgama, Thijs P. Kuipers, Erik J Bekkers

Published: 17 Jun 2024, Last Modified: 12 Jul 2024ICML 2024 Workshop GRaMEveryoneRevisionsBibTeXCC BY 4.0

Track: Extended abstract

Keywords: weight-sharing, equivariance, group convolutions

TL;DR: We propose a method for weight-sharing in neural networks via soft permutations

Abstract: Traditional group equivariant methods presuppose known groups, an assumption that can be unrealistic for real-world datasets and potentially too restrictive for neural network architectures. Typically, equivariance in neural networks is implemented through group transformations applied to a canonical weight tensor, facilitating weight sharing across a specified group GG. In this study, we introduce a method to learn such weight-sharing schemes. Our approach involves developing a set of learnable, doubly stochastic matrices that function as soft permutation matrices on canonical weight tensors, accommodating regular group representations as a specific instance. This allows for adaptive kernel transformations that are optimized in conjunction with downstream tasks. Our results demonstrate that when datasets display pronounced symmetries, the learned permutation matrices approximate regular group representations, effectively transforming our weight-sharing networks into standard group convolutional networks.

Submission Number: 38

Loading