Equivariant Neural Functional Networks for Transformers

Published: 05 Mar 2025, Last Modified: 05 Mar 2025ICLR 2025 Workshop Weight Space Learning SpotlightEveryoneRevisionsBibTeXCC BY 4.0
Track: long paper (up to 8 pages)
Keywords: neural functional network, transformer, maximal symmetric group, equivariant model, dataset
TL;DR: This paper systematically studies neural functional networks (NFNs) for Transformers, presenting a design principle, and an equivariant NFN called Transformer-NFN, along with a benchmark dataset for evaluation.
Submission Number: 27
Loading