Neural Networks with Complex-Valued Weights Have No Spurious Local Minima

Published: 10 Oct 2024, Last Modified: 07 Dec 2024NeurIPS 2024 WorkshopEveryoneRevisionsBibTeXCC BY 4.0
Keywords: Neural Networks, Optimization Landscape, Nonconvex Optimization
Abstract: We study the benefits of complex-valued weights for neural networks. We prove that shallow complex neural networks with quadratic activations have no spurious local minima. In contrast, shallow real neural networks with quadratic activations have infinitely many spurious local minima under the same conditions. In addition, we provide specific examples to demonstrate that complex- valued weights turn poor local minima into saddle points.
Submission Number: 117
Loading