Nonparametric Regression with Shallow Overparameterized Neural Networks Trained by GD with Early Stopping

Abstract: We explore the ability of overparameterized shallow neural networks to learn Lipschitz regression functions with and without label noise when trained by Gradient Descent (GD). To avoid the problem ...
0 Replies
Loading