SGD Finds then Tunes Features in Two-Layer Neural Networks with near-Optimal Sample Complexity: A Case Study in the XOR problem

Published: 01 Jan 2024, Last Modified: 28 Apr 2025ICLR 2024EveryoneRevisionsBibTeXCC BY-SA 4.0
Loading