Keywords: convolutional networks, feature extraction, deep learning theory
TL;DR: This paper introduces a novel mathematical framework for image classification, theoretically analyzing how convolutional neural networks extract features from images.
Abstract: Over the past decade deep learning has revolutionized the field of computer vision, with convolutional neural network models proving to be very effective for image classification benchmarks. Given their widespread adoption, several theoretical works have analyzed their expressiveness, and study the class of piecewise linear functions that they can realize. However, a fundamental theoretical questions remain answered: why are piecewise linear functions effective for feature extraction tasks that arise in image classification? We address this question in this paper by introducing a simplified mathematical model for feature extraction, based on classical template matching algorithms that are commonly used in computer vision. We then prove that convolutional neural network classifiers can solve this class of image classification problems, by constructing piecewise linear functions that detect the presence of features, and showing that they can be realized by convolutional neurons. We also discuss the interpretability of the networks we construct, and compare them with those obtained via gradient-based optimization methods by conducting experiments on simple datasets.
Supplementary Material: zip
Primary Area: learning theory
Code Of Ethics: I acknowledge that I and all co-authors of this work have read and commit to adhering to the ICLR Code of Ethics.
Submission Guidelines: I certify that this submission complies with the submission instructions as described on https://iclr.cc/Conferences/2025/AuthorGuide.
Reciprocal Reviewing: I understand the reciprocal reviewing requirement as described on https://iclr.cc/Conferences/2025/CallForPapers. If none of the authors are registered as a reviewer, it may result in a desk rejection at the discretion of the program chairs. To request an exception, please complete this form at https://forms.gle/Huojr6VjkFxiQsUp6.
Anonymous Url: I certify that there is no URL (e.g., github page) that could be used to find authors’ identity.
No Acknowledgement Section: I certify that there is no acknowledgement section in this submission for double blind review.
Submission Number: 8698
Loading