A Neural Tangent Kernel Perspective on Function-Space Regularization in Neural Networks

Zonghao Chen; Xupeng Shi; Tim G. J. Rudner; Qixuan Feng; WEIZHONG ZHANG; Tong Zhang

A Neural Tangent Kernel Perspective on Function-Space Regularization in Neural Networks

Zonghao Chen, Xupeng Shi, Tim G. J. Rudner, Qixuan Feng, WEIZHONG ZHANG, Tong Zhang

Published: 23 Nov 2022, Last Modified: 05 May 2023OPT 2022 PosterReaders: Everyone

Keywords: Function-Space Regularization, Neural Tangent Kernel, Generalization, Continual Learning

TL;DR: This paper proposes a function-space regularization technique for improving generalization in over-parameterized neural networks.

Abstract: Regularization can help reduce the gap between training and test error by systematically limiting model complexity. Popular regularization techniques such as L2 weight regularization act directly on the network parameters but do not explicitly take into account how the interplay between the parameters and the network architecture may affect the induced predictive functions. To address this shortcoming, we propose a simple technique for effective function-space regularization. Drawing on the result that fully-trained wide multi-layer perceptrons are equivalent to kernel regression under the Neural Tangent Kernel (NTK), we propose to approximate the norm of neural network functions by the reproducing kernel Hilbert space norm under the NTK and use it as a function-space regularizer. We prove that neural networks trained using this regularizer are arbitrarily close to kernel ridge regression solutions under the NTK. Furthermore, we provide a generalization error bound under the proposed regularizer and empirically demonstrate improved generalization and state-of-the-art performance on downstream tasks where effective regularization on the induced space of functions is essential.

0 Replies

Loading