Double Machine Learning Based Structure Identification from Temporal Data

TMLR Paper5042 Authors

05 Jun 2025 (modified: 14 Jul 2025)Under review for TMLREveryoneRevisionsBibTeXCC BY 4.0
Abstract: Learning the causes of time-series data is a fundamental task in many applications, spanning from finance to earth sciences or bio-medical applications. Common approaches for this task are based on vector auto-regression, and they do not take into account unknown confounding between potential causes. However, in settings with many potential causes and noisy data, these approaches may be substantially biased. Furthermore, potential causes may be correlated in practical applications or even contain cycles. To address these challenges, we propose a new double machine learning based method for structure identification from temporal data (DR-SIT). We provide theoretical guarantees, showing that our method asymptotically recovers the true underlying causal structure. Our analysis extends to cases where the potential causes have cycles, and they may even be confounded. We further perform extensive experiments to showcase the superior performance of our method. Code: https://anonymous.4open.science/r/TMLR_submission_DR_SIT-6B46/
Submission Length: Regular submission (no more than 12 pages of main content)
Assigned Action Editor: ~Adam_Arany1
Submission Number: 5042
Loading