Position: Probabilistic Modelling is Sufficient for Causal Inference

Bruno Kacper Mlodozeniec; David Krueger; Richard E. Turner

Position: Probabilistic Modelling is Sufficient for Causal Inference

Bruno Kacper Mlodozeniec, David Krueger, Richard E. Turner

Published: 01 May 2025, Last Modified: 15 Aug 2025ICML 2025 Position Paper Track oralEveryoneRevisionsBibTeXCC BY 4.0

TL;DR: We illustrate a probabilistic modelling approach to causal inference questions, challenging the prevalent claims that you need a bespoke causal framework or notation for causal inference.

Abstract: Causal inference is a key research area in machine learning, yet confusion reigns over the tools needed to tackle it. There are prevalent claims in the machine learning literature that you need a bespoke causal framework or notation to answer causal questions. In this paper, we make it clear that you can answer any causal inference question within the realm of probabilistic modelling and inference, without causal-specific tools or notation. Through concrete examples, we demonstrate how causal questions can be tackled by writing down the probability of everything. We argue for the advantages of the generality of the probabilistic modelling lens, when compared to bespoke causal frameworks. Lastly, we reinterpret causal tools as emerging from standard probabilistic modelling and inference, elucidating their necessity and utility.

Lay Summary: Science and machine learning often deal with questions of cause and effect. For example, does a drug actually cure a disease, or do healthier people just tend to take it? A common belief is that to answer these “causal” questions, you need a specialised mathematical language, separate from standard statistics. This paper illustrates that you don’t. We show that the familiar tools of probability are all you need. The key is to use a single statistical model for both the world we observe but also the hypothetical worlds we want to ask questions about. By explicitly specifying how they relate to one another, we can make inferences about events in the hypothetical worlds using data from the observed world only. Imagine that you have data showing that people who take more aspirin often get worse headaches. To find out if aspirin is actually helpful, or if the correlation between positive effects and higher aspirin doses spurious, we can add a hypothetical "intervened world" to our model—one where we decide the aspirin dose for everyone. By connecting these worlds through shared assumptions (like how the human body works), we can use real-world data to see what would happen in our hypothetical one. This approach simplifies answering causal questions, framing them as standard problems in probability. It shows that the prevalent specialised causal tools are just convenient shortcuts within this broader framework, making causal reasoning accessible to broader range of machine learning researchers.

Verify Author Names: My co-authors have confirmed that their names are spelled correctly both on OpenReview and in the camera-ready PDF. (If needed, please update ‘Preferred Name’ in OpenReview to match the PDF.)

No Additional Revisions: I understand that after the May 29 deadline, the camera-ready submission cannot be revised before the conference. I have verified with all authors that they approve of this version.

Pdf Appendices: My camera-ready PDF file contains both the main text (not exceeding the page limits) and all appendices that I wish to include. I understand that any other supplementary material (e.g., separate files previously uploaded to OpenReview) will not be visible in the PMLR proceedings.

Latest Style File: I have compiled the camera ready paper with the latest ICML2025 style files <https://media.icml.cc/Conferences/ICML2025/Styles/icml2025.zip> and the compiled PDF includes an unnumbered Impact Statement section.

Paper Verification Code: NzYyN

Permissions Form: pdf

Primary Area: Research Priorities, Methodology, and Evaluation

Keywords: probabilistic modelling, causal inference, graphical models, Bayesian inference, causality, potential outcomes, probabilistic programs

Submission Number: 152

Loading