Permutation-based Rank Test in the Presence of Discretization and Application in Causal Discovery with Mixed Data
Keywords: Rank Test; Discretization; Causal Discovery
Abstract: Recent advances have shown that statistical tests for the rank of cross-covariance matrices play an important role in causal discovery. These rank tests include partial correlation tests as special cases and provide further graphical information about latent variables.
Existing rank tests typically assume that all the continuous variables can be perfectly measured,
and yet, in practice many variables can only be measured after discretization.
For example, in psychometric studies,
the continuous level of certain personality dimensions of a person can only be measured after being discretized into order-preserving options such as disagree, neutral, and agree.
Motivated by this, we
propose Mixed data Permutation-based Rank Test (MPRT), which properly controls the statistical errors even when some or all variables are discretized.
Theoretically, we establish the
exchangeability and
estimate the asymptotic null distribution by permutations;
as a consequence,
MPRT can effectively control the Type I error in the presence of discretization while previous methods cannot.
Empirically, our method is validated by extensive experiments on synthetic data
and real-world data to demonstrate its effectiveness as well as applicability in causal discovery.
Primary Area: causal reasoning
Code Of Ethics: I acknowledge that I and all co-authors of this work have read and commit to adhering to the ICLR Code of Ethics.
Submission Guidelines: I certify that this submission complies with the submission instructions as described on https://iclr.cc/Conferences/2025/AuthorGuide.
Anonymous Url: I certify that there is no URL (e.g., github page) that could be used to find authors’ identity.
No Acknowledgement Section: I certify that there is no acknowledgement section in this submission for double blind review.
Submission Number: 6282
Loading