Model-based Asynchronous Hyperparameter and Neural Architecture Search

Aaron Klein; Louis Chi-Chun Tiao; Thibaut Lienart; Cedric Archambeau; Matthias Seeger

Model-based Asynchronous Hyperparameter and Neural Architecture Search

Aaron Klein, Louis Chi-Chun Tiao, Thibaut Lienart, Cedric Archambeau, Matthias Seeger

28 Sept 2020 (modified: 05 May 2023)ICLR 2021 Conference Blind SubmissionReaders: Everyone

Keywords: Bayesian Optimization, AutoML, Hyperparameter Optimization, Neural Architecture Search

Abstract: We introduce a model-based asynchronous multi-fidelity method for hyperparameter and neural architecture search that combines the strengths of asynchronous Successive Halving and Gaussian process-based Bayesian optimization. At the heart of our method is a probabilistic model that can simultaneously reason across hyperparameters and resource levels, and supports decision-making in the presence of pending evaluations. We demonstrate the effectiveness of our method on a wide range of challenging benchmarks, for tabular data, image classification and language modelling, and report substantial speed-ups over current state-of-the-art methods. Our new methods, along with asynchronous baselines, are implemented in a distributed framework which will be open sourced along with this publication.

One-sentence Summary: We present a new, asynchronous multi-fidelty Bayesian optimization method to efficiently search for hyperparameters and architectures of neural networks.

Code Of Ethics: I acknowledge that I and all co-authors of this work have read and commit to adhering to the ICLR Code of Ethics

Supplementary Material: zip

Reviewed Version (pdf): https://openreview.net/references/pdf?id=NxcbViotEH

15 Replies

Loading