Dichotomize and Generalize: PAC-Bayesian Binary Activated Deep Neural NetworksDownload PDF

Gaël Letarte, Pascal Germain, Benjamin Guedj, Francois Laviolette

06 Sept 2019 (modified: 05 May 2023)NeurIPS 2019Readers: Everyone
Abstract: We present a comprehensive study of multilayer neural networks with binary activation, relying on the PAC-Bayesian theory. Our contributions are twofold: (i) we develop an end-to-end framework to train a binary activated deep neural network, overcoming the fact that binary activation function is non-differentiable. (ii) we provide nonvacuous PAC-Bayesian generalization bounds for binary activated deep neural networks. Noteworthy, our results are obtained by minimizing the expected loss of an architecture-dependent aggregation of binary activated deep neural networks. The performance of our approach is assessed on a thorough numerical experiments protocol on real-life datasets.
Code Link: https://github.com/gletarte/dichotomize-and-generalize
CMT Num: 3731
0 Replies

Loading