Exploring The Loss Landscape Of Regularized Neural Networks Via Convex Duality

Sungyoon Kim; Aaron Mishkin; Mert Pilanci

Exploring The Loss Landscape Of Regularized Neural Networks Via Convex Duality

Sungyoon Kim, Aaron Mishkin, Mert Pilanci

Published: 22 Jan 2025, Last Modified: 29 Apr 2025ICLR 2025 OralEveryoneRevisionsBibTeXCC BY 4.0

Keywords: Convex duality, Machine Learning Theory, Loss Landscape, Optimal Sets

TL;DR: We investigate the loss landscape and topology of the optimal set of neural networks using convex duality.

Abstract: We discuss several aspects of the loss landscape of regularized neural networks: the structure of stationary points, connectivity of optimal solutions, path with non-increasing loss to arbitrary global optimum, and the nonuniqueness of optimal solutions, by casting the problem into an equivalent convex problem and considering its dual. Starting from two-layer neural networks with scalar output, we first characterize the solution set of the convex problem using its dual and further characterize all stationary points. With the characterization, we show that the topology of the global optima goes through a phase transition as the width of the network changes, and construct counterexamples where the problem may have a continuum of optimal solutions. Finally, we show that the solution set characterization and connectivity results can be extended to different architectures, including two layer vector-valued neural networks and parallel three-layer neural networks.

Supplementary Material: zip

Primary Area: learning theory

Code Of Ethics: I acknowledge that I and all co-authors of this work have read and commit to adhering to the ICLR Code of Ethics.

Submission Guidelines: I certify that this submission complies with the submission instructions as described on https://iclr.cc/Conferences/2025/AuthorGuide.

Anonymous Url: I certify that there is no URL (e.g., github page) that could be used to find authors’ identity.

No Acknowledgement Section: I certify that there is no acknowledgement section in this submission for double blind review.

Submission Number: 9266

Loading