Learning Curve Prediction with Bayesian Neural Networks

Aaron Klein, Stefan Falkner, Jost Tobias Springenberg, Frank Hutter

Nov 04, 2016 (modified: Mar 03, 2017) ICLR 2017 conference submission readers: everyone
  • Abstract: Different neural network architectures, hyperparameters and training protocols lead to different performances as a function of time. Human experts routinely inspect the resulting learning curves to quickly terminate runs with poor hyperparameter settings and thereby considerably speed up manual hyperparameter optimization. Exploiting the same information in automatic Bayesian hyperparameter optimization requires a probabilistic model of learning curves across hyperparameter settings. Here, we study the use of Bayesian neural networks for this purpose and improve their performance by a specialized learning curve layer.
  • TL;DR: We present a general probabilistic method based on Bayesian neural networks to predit learning curves of iterative machine learning methods.
  • Conflicts: ubc.ca, uni-freiburg.de
  • Keywords: Deep learning, Applications