Implicit Bias of the Step Size in Linear Diagonal Neural NetworksDownload PDFOpen Website

2022 (modified: 25 Apr 2023)ICML 2022Readers: Everyone
Abstract: Focusing on diagonal linear networks as a model for understanding the implicit bias in underdetermined models, we show how the gradient descent step size can have a large qualitative effect on the ...
0 Replies

Loading