On the Hardness of Conditional Independence Testing In Practice

Zheng He; Roman Pogodin; Yazhe Li; Namrata Deka; Arthur Gretton; Danica J. Sutherland

On the Hardness of Conditional Independence Testing In Practice

Zheng He, Roman Pogodin, Yazhe Li, Namrata Deka, Arthur Gretton, Danica J. Sutherland

Published: 18 Sept 2025, Last Modified: 16 Dec 2025NeurIPS 2025 spotlightEveryoneRevisionsBibTeXCC BY 4.0

Keywords: conditional independence, statistical testing, kernel methods

TL;DR: Regression errors in estimating conditionals cause spurious dependence under the null, violating asymptotic assumptions and leading to null calibration failure. Choosing the conditioning variable kernel helps power but makes bias worse.

Abstract: Tests of conditional independence (CI) underpin a number of important problems in machine learning and statistics, from causal discovery to evaluation of predictor fairness and out-of-distribution robustness. Shah and Peters (2020) showed that, contrary to the unconditional case, no universally finite-sample valid test can ever achieve nontrivial power. While informative, this result (based on “hiding” dependence) does not seem to explain the frequent practical failures observed with popular CI tests. We investigate the Kernel-based Conditional Independence (KCI) test – of which we show the Generalized Covariance Measure underlying many recent tests is _nearly_ a special case – and identify the major factors underlying its practical behavior. We highlight the key role of errors in the conditional mean embedding estimate for the Type I error, while pointing out the importance of selecting an appropriate conditioning kernel (not recognized in previous work) as being necessary for good test power but also tending to inflate Type I error.

Primary Area: General machine learning (supervised, unsupervised, online, active, etc.)

Submission Number: 26249

Loading