Published: 29 Aug 2025, Last Modified: 04 Sept 2025AutoML 2025 Non-Archival Content TrackEveryoneRevisionsBibTeXCC BY 4.0
Submission Type:Short paper
Tldr:We generalize NASWOT to support diverse activation functions for training-free performance prediction, but surprisingly find that ReLU alone retains most of the relevant information across NLP benchmarks.