A Transductive Forest for Anomaly Detection with Few Labels

Jingrui Zhang, Ninh Pham, Gillian Dobbie

Published: 01 Jan 2023, Last Modified: 25 May 2024ECML/PKDD (1) 2023EveryoneRevisionsBibTeXCC BY-SA 4.0

Abstract: Extensive labeled training data for anomaly detection is enormously expensive and often unavailable in data-sensitive applications due to privacy constraints. We propose TransForest, a transductive forest for anomaly detection, in the semi-supervised setting where few labels are available. Guided by little label information, TransForest pushes classification boundaries toward sensitive areas where abnormal and normal points are located, increasing learning capacity. Empirically, TransForest is competitive with other unsupervised and semi-supervised representative detectors given a small number of labeled points. TransForest also offers a feature importance ranking consistent with the rankings provided by popular supervised forests on low-dimensional data sets. Our code is available at https://github.com/jzha968/transForest.