MACAW: A Causal Generative Model for Medical Imaging

MACAW: A Causal Generative Model for Medical Imaging

TMLR Paper5831 Authors

06 Sept 2025 (modified: 02 Feb 2026)Decision pending for TMLREveryoneRevisionsBibTeXCC BY 4.0

Abstract: Although deep learning techniques show promising results for many neuroimaging tasks in research settings, they have not yet found widespread use in clinical scenarios. One of the reasons for this problem is that many machine learning models only identify correlations between the input images and the outputs of interest, which can lead to many practical problems, such as encoding of uninformative biases and reduced explainability. Thus, recent research is exploring if integrating \textit{a priori} causal knowledge into deep learning models is a potential avenue to identify these problems. However, encoding causal reasoning and generating genuine counterfactuals necessitates computationally expensive invertible processes, thus restricting analyses to a small number of causal variables and rendering them infeasible for generating even 2D images. To overcome these limitations, this work introduces a new causal generative architecture named Masked Causal Flow (MACAW) for neuroimaging applications. Within this context, three main contributions are described. First, a novel approach that integrates complex causal structures into normalizing flows is proposed. Second, counterfactual prediction is performed to identify the changes in effect variables associated with a cause variable. Finally, an explicit Bayesian inference for classification is derived and implemented, providing an inherent uncertainty estimation. The feasibility of the proposed method was first evaluated using synthetic data and then using MRI brain data from more than 23000 participants of the UK biobank study. The evaluation results show that the proposed method can (1) accurately encode causal reasoning and generate counterfactuals highlighting the structural changes in the brain known to be associated with aging, (2) accurately predict a subject's age from a single 2D MRI slice, and (3) generate new samples assuming other values for subject-specific indicators such as age, sex, and body mass index.

Submission Length: Long submission (more than 12 pages of main content)

Changes Since Last Submission: 1. The notation for the *do* operator has been updated throughout the manuscript from $ do(x=\alpha) $ to $ do(x=\alpha \mid x=\mathbf{x}^{obs}) $. 2. Additional appendices have been included to present supplementary results and experiments: * Additional results for the 1-D experiments * MorphoMNIST experiments 3. The following sections have been revised: * **2.3 Related Work:** clarification on the theoretical soundness of the proposed approach * **4.1.3 Generative Sampling:** additional 1-D experiment results using MMD * **4.3 Comparison with H-VAE:** new results demonstrating MACAW’s effectiveness and runtime efficiency * **5 Discussion:** * clarification on invertibility and theoretical soundness * shortcomings of the H-VAE comparisons * dimensionality-reduction approaches and issues related to latent space

Assigned Action Editor: ~Krikamol_Muandet1

Submission Number: 5831

Loading