RealCause: Realistic Causal Inference Benchmarking

Brady Neal; Chin-Wei Huang; Sunand Raghupathi

RealCause: Realistic Causal Inference Benchmarking

Brady Neal, Chin-Wei Huang, Sunand Raghupathi

07 Jun 2021 (modified: 26 May 2025)Submitted to NeurIPS 2021 Datasets and Benchmarks Track (Round 1)Readers: Everyone

Keywords: Causa inference, benchmarking

TL;DR: We create a benchmarking framework that allows you to create realistic benchmarks.

Abstract: There are many different causal effect estimators in causal inference. However, it is unclear how to choose between these estimators because there is no ground-truth for causal effects. A commonly used option is to simulate synthetic data, where the ground-truth is known. However, the best causal estimators on synthetic data are unlikely to be the best causal estimators on real data. An ideal benchmark for causal estimators would both (a) yield ground-truth values of the causal effects and (b) be representative of real data. Using flexible generative models, we provide a benchmark that both yields ground-truth and is realistic. Using this benchmark, we evaluate over 1500 different causal estimators and provide evidence that it is rational to choose hyperparameters for causal estimators using predictive metrics.

Supplementary Material: zip

URL: https://github.com/bradyneal/realcause

Community Implementations: [![CatalyzeX](/images/catalyzex_icon.svg) 1 code implementation](https://www.catalyzex.com/paper/realcause-realistic-causal-inference/code)

7 Replies

Loading