HPOD: Hyperparameter Optimization for Unsupervised Outlier Detection

Yue Zhao; Leman Akoglu

HPOD: Hyperparameter Optimization for Unsupervised Outlier Detection

Yue Zhao, Leman Akoglu

Published: 30 Apr 2024, Last Modified: 05 Sept 2024AutoML 2024EveryoneRevisionsBibTeXCC BY 4.0

Keywords: outlier detection, hyperparameter optimization, automated machine learning

TL;DR: A meta-learning framework to optimize hyperparameters for unsupervised outlier detection

Abstract: Given an unsupervised outlier detection (OD) algorithm, how can we optimize its hyperparameter(s) (HP) on a new dataset, without using any labels? In this work, we address this challenging hyperparameter optimization for unsupervised OD problem, and propose the first continuous HP search method called HPOD. It capitalizes on the prior performance of a large collection of HPs on existing OD benchmark datasets, and transfers this information to enable HP evaluation on a new dataset without labels. Also, HPOD adapts a prominent, (originally) supervised, sampling paradigm to efficiently identify promising HPs in iterations. Extensive experiments show that HPOD works for both deep (e.g., Robust AutoEncoder (RAE)) and shallow (e.g., Local Outlier Factor (LOF) and Isolation Forest (Forest)) algorithms on discrete and continuous HP spaces. HPOD outperforms a wide range of diverse baselines with 37% improvement on average over the minimal loss HPs of RAE, and 58% and 66% improvement on average over the default HPs of LOF and iForest.

Submission Checklist: Yes

Broader Impact Statement: Yes

Paper Availability And License: Yes

Code Of Conduct: Yes

Code And Dataset Supplement: zip

Optional Meta-Data For Green-AutoML: This blue field is just for structuring purposes and cannot be filled.

Community Implementations: https://github.com/review2024/HPOD

Submission Number: 8

Loading