Gradient descent with identity initialization efficiently learns positive definite linear transformationsDownload PDFOpen Website

2018 (modified: 11 Nov 2022)ICML 2018Readers: Everyone
Abstract: We analyze algorithms for approximating a function $f(x) = \Phi x$ mapping $\Re^d$ to $\Re^d$ using deep linear neural networks, i.e. that learn a function $h$ parameterized by matrices $\Theta_1,....
0 Replies

Loading