Keywords: modular addition, cosets, mechanistic interpretability, manifold hypothesis, universality hypothesis
Abstract: We find coset and approximate coset circuits play a key role in how multilayer perceptrons learn dihedral group multiplication, consistent with recent findings on modular addition.
Submission Number: 115
Loading