Deep Learning is Singular, and That's Good

Daniel Murfet; Susan Wei; Mingming Gong; Hui Li; Jesse Gell-Redman; Thomas Quella

Deep Learning is Singular, and That's Good

Daniel Murfet, Susan Wei, Mingming Gong, Hui Li, Jesse Gell-Redman, Thomas Quella

28 Sept 2020 (modified: 22 Jun 2025)ICLR 2021 Conference Blind SubmissionReaders: Everyone

Keywords: deep learning theory, effective degrees of freedom, generalisation, posterior predictive distribution, real log canonical threshold, singular learning theory

Abstract: In singular models, the optimal set of parameters forms an analytic set with singularities and classical statistical inference cannot be applied to such models. This is significant for deep learning as neural networks are singular and thus ``dividing" by the determinant of the Hessian or employing the Laplace approximation are not appropriate. Despite its potential for addressing fundamental issues in deep learning, singular learning theory appears to have made little inroads into the developing canon of deep learning theory. Via a mix of theory and experiment, we present an invitation to singular learning theory as a vehicle for understanding deep learning and suggest important future work to make singular learning theory directly applicable to how deep learning is performed in practice.

One-sentence Summary: An invitation to singular learning theory as a theory of deep learning

Code Of Ethics: I acknowledge that I and all co-authors of this work have read and commit to adhering to the ICLR Code of Ethics

Supplementary Material: zip

Community Implementations: [![CatalyzeX](/images/catalyzex_icon.svg) 2 code implementations](https://www.catalyzex.com/paper/deep-learning-is-singular-and-that-s-good/code)

Reviewed Version (pdf): https://openreview.net/references/pdf?id=0L9nqwO3PE

9 Replies

Loading