On the Fly Detection of Root Causes from Observed Data with Application to IT SystemsDownload PDFOpen Website

Published: 01 Jan 2024, Last Modified: 21 Feb 2024CoRR 2024Readers: Everyone
Abstract: This paper introduces a new structural causal model tailored for representing threshold-based IT systems and presents a new algorithm designed to rapidly detect root causes of anomalies in such systems. When root causes are not causally related, the method is proven to be correct; while an extension is proposed based on the intervention of an agent to relax this assumption. Our algorithm and its agent-based extension leverage causal discovery from offline data and engage in subgraph traversal when encountering new anomalies in online data. Our extensive experiments demonstrate the superior performance of our methods, even when applied to data generated from alternative structural causal models or real IT monitoring data.
0 Replies

Loading