Causal Structure of the Planet Earth: Learning System Dynamics with Hypergraphs

Published: 22 Sept 2025, Last Modified: 22 Sept 2025WiML @ NeurIPS 2025EveryoneRevisionsBibTeXCC BY 4.0
Keywords: causal inference, hypergraphs, transportability, foundation models
Abstract: \documentclass[11pt]{article} % Margins \usepackage[margin=0.75in]{geometry} % Figures and graphics \usepackage{graphicx} \usepackage{amsmath, amssymb} % for math \usepackage{url} % Optional: shrink references to fit within one page \usepackage{natbib} \bibliographystyle{plainnat} % \setlength{\bibsep}{0pt plus 0.3ex} % tighter references \usepackage[colorlinks=true, linkcolor=blue, citecolor=blue, urlcolor=blue]{hyperref} % Make sure it's anonymous: no author info \title{Causal Structure of Earth: Learning System Dynamics with Hypergraphs} \author{} \date{} \begin{document} \maketitle \begin{abstract} Abrupt shifts in the Earth system such as Amazon forest dieback, Arctic ice loss, or monsoon collapse arise from complex teleconnections \citep{beamish1995climate}, where variability in one region drives ecological responses elsewhere. Standard causal discovery methods, including Granger causality and PCMCI+, represent dependencies with pairwise graphs . While effective for low-dimensional signals, they cannot capture the higher-order interactions that often trigger regime shifts, for example when both tropical Atlantic warming and Pacific circulation anomalies jointly cause Amazon drought \citep{marengo2008hydro}. We introduce a framework that models causal structure as a hypergraph \citep{ma2022learning} $H=(V,\mathcal{E})$, where $\mathcal{E}\subseteq 2^V\setminus\{\varnothing\}$ encodes group-level edges linking subsets of variables to outcomes. For any subset $S\subseteq V$, we define the set-wise causal effect \[ \tau(S\!\to\!Y;x_S,x'_S)=\mathbb{E}[Y\mid \operatorname{do}(X_S=x_S)]-\mathbb{E}[Y\mid \operatorname{do}(X_S=x'_S)], \] which generalizes pairwise interventions to multi-variable causes. This representation allows us to test whether sets of factors rather than individual variables drive critical transitions. Our method extends constraint-based and score-based discovery by lifting conditional independence tests and sparsity penalties from pairwise to set-wise relations. Candidate hyper-edges are screened via conditional mutual information $I(Y;X_S\mid Z)$ over spatio-temporal contexts, and group-sparse penalties control combinatorial growth. We implement this directly on multi-scale latent features from a recent Earth-system foundation models, enabling discovery under non-stationarity and across spatial scales. To address data-poor regions such as the Arctic, polar oceans, or tropical rainforests, we integrate hypergraph discovery with causal transportability \citep{pearl2010causal}. By constructing selection diagrams that encode cross-domain differences, we identify conditions under which causal relations transfer from data-rich basins (e.g., the Pacific Ocean) to under-observed ecosystems. This extends counterfactual reasoning \citep{pearl2010causal} to regions where direct observations are limited, enabling forecasts of coral reef collapse, Arctic ice decline, or rainforest tipping points. Preliminary results show that hypergraph discovery (i) recovers known teleconnections such as ENSO--East Africa rainfall and Arctic --midlatitude winter links , (ii) reveals higher-order interactions (triplets and quartets) that outperform pairwise models in explaining regime shifts, and (iii) enables counterfactual experiments that assess scenarios such as reduced Arctic warming trajectories. By moving beyond pairwise causality, this work demonstrates that hypergraph-based causal discovery in foundation models yields more interpretable and actionable forecasts of cascading climate risks, offering a new pathway to earlier warnings and more targeted adaptation strategies. \end{abstract} % References should be concise \bibliographystyle{plainnat} \bibliography{refs} \end{document}
Submission Number: 428
Loading