HPO: We won’t get fooled again

René Traoré; Andrés Camero; Xiao Xiang Zhu

HPO: We won’t get fooled again

René Traoré, Andrés Camero, Xiao Xiang Zhu

Published: 16 May 2022, Last Modified: 05 May 2023AutoML 2022 (Late-Breaking Workshop)Readers: Everyone

Abstract: Hyperparameter optimization (HPO) is a well-studied research field. However, the effects and interactions of the components in an HPO pipeline are not yet well investigated. Then, we ask ourselves: Can the landscape of HPO be biased by the pipeline used to evaluate individual configurations? To address this question, we proposed to analyze the effect of the HPO pipeline on HPO problems using fitness landscape analysis. Particularly, we studied the DS-2019 HPO benchmark data set, looking for patterns that could indicate evaluation pipeline malfunction, and relate them to HPO performance. Our main findings are: (i) In most instances, large groups of diverse hyperparameters (i.e., multiple configurations) yield the same \emph{ill} performance, most likely associated with majority class prediction models; (ii) in these cases, a worsened correlation between the observed fitness and average fitness in the neighborhood is observed, potentially making harder the deployment of local-search-based HPO strategies. Finally, we concluded that the HPO pipeline definition might negatively affect the HPO landscape.

Keywords: AutoML, Hyperparameter Optimization, Landscape Analysis

One-sentence Summary: In this paper, we analyze Hyperparameter Optimization (HPO) problems using fitness landscape analysis, and look for patterns that could indicate evaluation pipeline malfunction, and relate them to HPO performance.

Reproducibility Checklist: Yes

Broader Impact Statement: Yes

Paper Availability And License: Yes

Code Of Conduct: Yes

Reviewers: René Traoré

Code And Dataset Supplement: https://github.com/anonymous-for-open-review/late-breaking-automlConf-2022

Main Paper And Supplementary Material: pdf

1 Reply

Loading