Towards Unbiased Evaluation of Time-series Anomaly Detector

Published: 10 Oct 2024, Last Modified: 26 Nov 2024NeurIPS 2024 TSALM WorkshopEveryoneRevisionsBibTeXCC BY 4.0
Keywords: time series, anomaly detection, point adjustment, F1 score
TL;DR: This work proposes an adjustment protocol for time-series anomaly detection (TSAD) called ``Balanced point adjustment'' (BA). It addresses the limitations of existing point adjustments and provides fairness guarantees.
Abstract: Time series anomaly detection (TSAD) is an evolving area of research motivated by its critical applications, such as detecting seismic activity, sensor failures in industrial plants, predicting crashes in the stock market, and so on. Anomalies are rare events, making the F1-score the most commonly adopted metric for anomaly detection. However, in time series the challenge of using standard F1-score is the dissociation between time points and time events. To accommodate this, anomaly predictions are adjusted, called point adjustment (PA), before the $F_1$-score evaluation. However, these adjustments are heuristics-based, and biased towards true positive detection, resulting in over-estimated detector performance. However, the current time-series foundation model literature continues to use PA for model evaluation. Such obtained model perspectives are not a true indication of the performance. This work proposes an alternative adjustment protocol called ``Balanced point adjustment'' (BA). It addresses the limitations of existing point adjustments and provides fairness guarantees backed by axiomatic definitions of TSAD evaluation.
Submission Number: 86
Loading