The Representations of Deep Neural Networks Trained on Dihedral Group Multiplication

Published: 22 Sept 2025, Last Modified: 22 Sept 2025WiML @ NeurIPS 2025EveryoneRevisionsBibTeXCC BY 4.0
Keywords: universality hypothesis, cosets, manifold hypothesis, interpretability, modular addition
Abstract: We find coset and approximate coset circuits play a key role in how multilayer perceptrons learn dihedral group multiplication, consistent with recent findings on modular addition. We identify that neural preactivations concentrate on (approximate) cosets and visualize the manifolds distributed across neurons that correspond to the (approximate) coset representations.
Submission Number: 342
Loading