MIRACLE: Causally-Aware Imputation via Learning Missing Data Mechanisms

Trent Kyono; Yao Zhang; Alexis Bellot; Mihaela van der Schaar

MIRACLE: Causally-Aware Imputation via Learning Missing Data Mechanisms

Trent Kyono, Yao Zhang, Alexis Bellot, Mihaela van der Schaar

Published: 09 Nov 2021, Last Modified: 26 May 2025NeurIPS 2021 PosterReaders: Everyone

Keywords: Missing data, Multiple imputation, Deep learning

TL;DR: We introduce a framework for missing data imputation that leverages causal structure and missing data indicators.

Abstract: Missing data is an important problem in machine learning practice. Starting from the premise that imputation methods should preserve the causal structure of the data, we develop a regularization scheme that encourages any baseline imputation method to be causally consistent with the underlying data generating mechanism. Our proposal is a causally-aware imputation algorithm (MIRACLE). MIRACLE iteratively refines the imputation of a baseline by simultaneously modeling the missingness generating mechanism, encouraging imputation to be consistent with the causal structure of the data. We conduct extensive experiments on synthetic and a variety of publicly available datasets to show that MIRACLE is able to consistently improve imputation over a variety of benchmark methods across all three missingness scenarios: at random, completely at random, and not at random.

Code Of Conduct: I certify that all co-authors of this work have read and commit to adhering to the NeurIPS Statement on Ethics, Fairness, Inclusivity, and Code of Conduct.

Supplementary Material: pdf

Community Implementations: [![CatalyzeX](/images/catalyzex_icon.svg) 1 code implementation](https://www.catalyzex.com/paper/miracle-causally-aware-imputation-via/code)

16 Replies

Loading