Learning Curve Prediction with Bayesian Neural Networks

Aaron Klein; Stefan Falkner; Jost Tobias Springenberg; Frank Hutter

Learning Curve Prediction with Bayesian Neural Networks

Aaron Klein, Stefan Falkner, Jost Tobias Springenberg, Frank Hutter

Published: 06 Feb 2017, Last Modified: 05 May 2023ICLR 2017 PosterReaders: Everyone

Abstract: Different neural network architectures, hyperparameters and training protocols lead to different performances as a function of time. Human experts routinely inspect the resulting learning curves to quickly terminate runs with poor hyperparameter settings and thereby considerably speed up manual hyperparameter optimization. Exploiting the same information in automatic Bayesian hyperparameter optimization requires a probabilistic model of learning curves across hyperparameter settings. Here, we study the use of Bayesian neural networks for this purpose and improve their performance by a specialized learning curve layer.

TL;DR: We present a general probabilistic method based on Bayesian neural networks to predit learning curves of iterative machine learning methods.

Conflicts: ubc.ca, uni-freiburg.de

Keywords: Deep learning, Applications

13 Replies

Loading