\spaceendsection
\section{Distributed Discussion}
\label{sec:discussion}
\spacebefsection
% \nocite{lauritzen2017exchangeability}
% To conclude, we describe some promising directions for future work. 
% The inspiration for this work came from \cite{lauritzen2017exchangeability}, 
% which classify the 6 causal structures that are invariant to node exchangeability.  
% The inspiration for this work came from \cite{lauritzen2017exchangeability}, 
% which classify the 6 causal structures that are invariant to node exchangeability.  
% Initially, we set out to classify similar causal models for networks that grow one node at a time.  
% This requirement turned out to be overly restrictive, 
% and we were 
% pleasantly surprised 
% to find causal structures that were less rigid in the order of their generation. 
\potentiallyImprove{Initially, we set out to classify similar causal models for networks that grow one node at a time.  
This requirement turned out to be overly restrictive, 
and we were 
pleasantly surprised 
to find causal DAGs 
that were less rigid in the order of their generation.} 
% In addition to the \ppaName model, 
% We describe two more here, then conclude with a discussion about their composability.
% In particular, the {\ppaAcron} model is 
% We describe two more here, and conclude with a discussion about composing different causal structures over time.

\potentiallyImprove{In particular, causal meta-DAGs in which the dyads only depend on one of the two ``quadrants'' of past dyads---such as the ``{\ppaAcron} w/ clustering'' or ``bottom-up causality'' in Fig.~\ref{fig:PosetMetaDAG}---can be evaluated in a distributed manner (as illustrated in Fig.~\ref{fig:FirstNatureFigure}).}  
% In \Cref{fig:PosetMetaDAG}, subsets of causal arrows that appear below either``{\ppaAcron} w/ clustering'' or ``bottom-up causality'' with respect to the poset in )
% can be evaluated in a distributed manner (as illustrated in Fig.~\ref{fig:FirstNatureFigure}).  
For these models, \potentiallyImprove{coarse-graining the rows and columns of the grid of dyad variables 
results in blocks of dyads with a similar causal structure.}  
\potentiallyImprove{Thus, one can assign ``workers'' to different blocks of dyads to evaluate them in parallel 
requiring communication only when workers move to the next block.}  
\potentiallyImprove{For example, in the model below, 
$w$ workers can alternate between evaluating blocks of size \mbox{$\frac{n}{w}$-by-$\frac{n}{w}$} and a total of $2w$ rounds of communication.}
%
% \spaceendsubsection
% \section{Distributed Discussion}
% \label{sec:discussionmodelexamples}
% \spacebefsubsection

\paragraph{{\ppaAcron} model with clustering  --- $\HubName+\PathName+\OldName$.} 
\novo{As an extension of the {\ppaAcron} model, 
one could include $\OldName$ causal arrows 
(see bottom-right of Fig.~\ref{Fig:ExampleGraphicalModelLocalDirectSeveral} for its  meta-DAG.).  
This addition would, for example, 
allow for the in-degrees to also exhibit a power-law (as dyads of an income node would no longer be independent).}  
Moreover, when both $\OldName$ and $\PathName$ \potentiallyImprove{causal arrows are present, 
it is possible to promote clustering via triadic closure, 
as the similarity between the connections that nodes $i$ and $j$ make with the ``distant'' nodes \mbox{$\BoxNodeS{}<i<j$} can influence the likelihood that $i$ and $j$ themselves form a connection.}  

\paragraph{A ``bottom-up'' causality  --- $\HubName+\NewName$.} 
The {\ppaAcron} model and its extension including $\OldName$ have a ``top-down'' 
% sort of 
causal structure, 
with dyads containing older nodes influencing dyads containing newer nodes. 
Conversely, the causal model with $\HubName$ and $\NewName$ depends 
%\potentiallyImprove{on the other ``quadrant'' of dyads, 
% and instead has a sort of ``bottom-up'' sort of causal structure} 
\potentiallyImprove{on the other ``quadrant'' of dyads, 
and instead has a ``bottom-up'' causal structure} 
% \potentiallyImprove{(see its causal meta-DAG at the top-right of Fig.~\ref{Fig:ExampleGraphicalModelLocalDirectSeveral}).}  
\potentiallyImprove{(see top-right of Fig.~\ref{Fig:ExampleGraphicalModelLocalDirectSeveral} for its meta-DAG).}  
That is, the dyads containing nodes that are \textit{closer together} in the ordering 
influence the outcomes of dyads containing nodes that are \textit{further apart}.  
This causal meta-DAG could be useful for modeling ``local'' clustering  
between nodes that occur at similar times. \commentToDo{maybe put example for reasons for two different types of clustering.}  
% While this combination can also be sampled in a distributed manner,
% it is incompatible with the causal models mentioned above.  

% \paragraph{Composing Causalities.}
% \potentiallyImprove{This incompatibility can be resolved by composing multiple causal DAGs together, 
% performing the acyclic computations in one, and using the outcomes to seed the other.  
% Indeed, many apparent cycles in causal models can be ``fine-grained'' away, 
% as they actually represent an underlying series of back-and-forth communication.  
% Decomposing the behavior of complex interconnected networks into a collection causal DAGs could prove a fruitful direction for future study.}  

\spaceendsubsection
\subsection{Sparse composable structural equations}
\label{sec:DorPAModel}
\spacebefsubsection
% \cite{griffiths2011indian} ghahramani2005infinite
 % Aldous, D. J. (1985). "Exchangeability and related topics". École d'Été de Probabilités de Saint-Flour XIII — 1983. Lecture Notes in Mathematics. Vol. 1117. pp. 1–198. doi:10.1007/BFb0099421. ISBN 978-3-540-15203-3. The restaurant process is described on page 92.
% \citep{aldous2006ecole}
% \citep{pitman1995exchangeable}
% teh2010hierarchical
% DAPA is a fine model indeed, 
% but here is another ORption:
As previously mentioned (\Cref{sec:localcausalarrows}), the {$\OldName$} and {$\NewName$} causal arrows cannot be included in the same causal meta-DAG.  
Here we propose a way to do essentially that.  

% The choice of an affine structural equation in the {\ppaAcron} model was motivated in part by its similarity to other classical growing models, such as Pólya's urn \citep{eggenberger1923statistik,mahmoud2008polya}, the process of Pitman-Yor \citep{pitman1997two}, and various Canonical Restaurant processes (such as the well-known CRP \citep{aldous2006ecole} and IBP \citep{ghahramani2005infinite} (aka, the Infinite Buffet process).  
The choice of an affine structural equation in the {\ppaAcron} model was motivated in part by its similarity to other classical growing models, such as Pólya's urn \citep{eggenberger1923statistik,mahmoud2008polya}, the process of Pitman-Yor \citep{pitman1997two}, and various Canonical Restaurant  processes \citep{aldous2006ecole,ghahramani2005infinite}.  

Here is another option for \Cref{eq:PPAprob}, with \potentiallyImprove{ellipsis (...) to suggest its straightforward generalization:} 
\begin{align}
    p_{ij}^{ } &= 1 - \exp\!\bigg( - \frac{\alphaP + \thetain d_i^{\text{in}} + \thetaout d_i^{\text{out}} + \cdots}{j+\betaP}\bigg) \label{eq:DORPAprob}
\end{align}
This simple transformation of the affine model retains all the asymptotics of the original {\ppaAcron} model  
(since \mbox{\smash{$1-\exp(-p_{ij})\approx p_{ij}$}} for \mbox{$p_{ij}\ll1$}).  
\potentiallyImprove{But now the inclusion of additional terms such as \mbox{\smash{$\thetaold d^{\text{old}}_j$}} predictably increase the probability of an edge 
(without becoming greater than $1$).  
Also, different choices for the denominators might allow for better modeling of growing networks over a wide range of scales.}  

\potentiallyImprove{However, the most compelling property of a model using \Cref{eq:DORPAprob} is that it allows one to effectively sample from causal models that apparently have cycles!}   
Moreover, the algorithm is naturally asynchronous, and exploits the sparsity of the resulting network.
\commentToDo{explain these things in more detail.} 

% \potentiallyImprove{Here is a sketch of the algorithm.  
% Initialize all dyads in the graph as ``empty''.  
% Add an edge independently for each dyad variable with probability \mbox{$\exp(-\frac{\alphaP}{j+\betaP})$}, and set those dyads as ``active''.  
% Iteratively take an active dyad, set it as ``completed'', 
% and sample its {$\HubName$} children independently with probability \mbox{$\exp(-\frac{\thetain}{j+\betaP})$}, 
% and likewise for {$\PathName$} children.  
% For any sampled children dyads that are currently ``empty'', 
% add that edge to the graph and set those dyads as ``active''.  
% Continue until there are no ``active'' dyads.}  
\potentiallyImprove{Here is a sketch of the algorithm for such a model.   
Initialize all dyads in the network as ``empty''.  
Add an edge independently for each dyad variable with probability \mbox{$\exp(-\frac{\alphaP}{j+\betaP})$}, and set those dyads as ``active''.  
Iteratively take an active dyad, set it as ``completed'', 
and sample its {$\HubName$} children independently with probability \mbox{$\exp(-\frac{\thetain}{j+\betaP})$}, 
and likewise for {$\PathName$} children.  
For any sampled children dyads that are currently ``empty'', 
add that edge to the network and set those dyads as ``active''.  
Continue until there are no ``active'' dyads.} 
%

% \potentiallyImprove{This perspective allows for directed cycles between the dyads, 
% essentially ``fine-graining'' them into a series of back-and-forth communication, 
% while the acyclic transitions \mbox{($\text{``empty''}\rightarrow\text{``active''}\rightarrow\text{``completed''}$)} ensure that the algorithm will terminate for a finite graph.  
% Modeling the behavior of complex interconnected networks in this modular ``event-based'' manner is a promising direction for future study.}  
\potentiallyImprove{This perspective effectively allows for directed cycles between the dyads, 
essentially ``fine-graining'' them into a series of back-and-forth communication, 
while the acyclic transitions \mbox{($\text{``empty''}\rightarrow\text{``active''}\rightarrow\text{``completed''}$)} ensure that the algorithm will terminate for a finite network.  
Modeling the behavior of complex interconnected networks in this modular ``event-based'' manner is a promising direction for future study.}


% \paragraph{Fin.}\\
% % \noindent
% \hfill$\blacksquare$
% In summary, 
% In a nutshell, 
% our key insight in this work was to study invariance of 
% % the 
% causal mechanisms instead of node exchangeability, 
% connecting 
% the framework of 
% structural causal models 
% (SCM) 
% with 
% statistically attractive 
% generative models for growing networks. 

% \spaceendsection

% \subsection{Promising Sequels} % possible sequels promising sequels  Future Directions
% \spacebefsubsection

% \commentG{this might go on the applications and extensions section}
% ***This work opens up many interesting directions... 

% \paragraph{General message.}
% The initial inspiration for this work 
% % come from a 
% was a 
% % neat 
% paper by \citet{lauritzen2017exchangeability} connecting maximum entropy distributions over (undirected simple) graphs with $n$ nodes 
% % (\ie, ERGMs), 
% (a.k.a., ERGMs), 
% % (\ie, exponential random graph models, or ERGMs) 
% and graphical models describing the conditional independence structure of 
% % their 
% the ERGMs's 
% ${n \choose 2}$ dyads 
% % random 
% variables.  
% % (Note that ERGMs are exchangeable over the nodes, but their random variables are the ${n \choose 2}$ dyads.) 
% % So, they ask about what are the allowed conditional independent statements between the dyads, under the constraint that the distribution is exchangeable over nodes. 
% They showed that there are only 6 different classes of graphical models and described the sufficient statistics of the associated ERGMs. 
% (Note that ERGMs are exchangeable over the nodes, 
% but their random variables are the 
% % ${n \choose 2}$ 
% dyads.) 



% the conditional independence between the ${n \choose 2}$ dyads variables 
% statements that are implied by graphical models.  
% in which
% In that paper in which the authors' characterize the types of independence structure between the ${n \choose 2}$ dyads variables of an Exponential Random Graph Models (ERGMs) for simple graphs with $n$ nodes. 
%
% which described the conditional independence structure of the dyads of exchangeable networks models over a finite number of nodes, that is, ERGMs. 
% Exponential random network graph models (ERGMs) over simple graphs with $n$ nodes. 
% We study conditional independence relationships for random networks and their interplay with exchangeability. We show that, for finitely exchangeable network models, the empirical subgraph densities are maximum likelihood estimates of their theoretical counterparts. We then characterize all possible Markov structures for finitely exchangeable random graphs, thereby identifying a new class of Markov network models corresponding to bidirected Kneser graphs. In particular, we demonstrate that the fundamental property of dissociatedness corresponds to a Markov property for exchangeable networks described by bidirected line graphs. Finally we study those exchangeable models that are also summarized in the sense that the probability of a network depends only on the degree distribution, and we identify a class of models that is dual to the Markov graphs of Frank and Strauss. Particular emphasis is placed on studying consistency properties of network models under the process of forming subnetworks and we show that the only consistent systems of Markov properties correspond to the empty graph, the bidirected line graph of the complete graph and the complete graph. 
%%
% A key insight of this work was to study invariance of 
% % the 
% causal mechanisms instead of node exchangeability, 
% connecting 
% the framework of 
% structural causal models (SCM) with 
% statistically attractive 
% generative models for growing networks. 
%



% \potentiallyImprove{
% A key insight of this work 
% is 
% to impose invariance of 
% the 
% causal mechanisms 
% % for generating the network 
% instead of node exchangeability, 
% thereby 
% connecting 
% the framework of 
% structural causal models (SCM) with 
% statistically attractive 
% generative models for growing networks.
% }



% Node-exchangeable models of networks (such as graphons) frequently have difficulty describing real-world networks. 
% For example, they are unable to describe sparse networks, essentially treating them all as equivalent to the network without edges. 
% While various modifications have been suggested to cope with this issue, many of the hallmarks of real-world networks do not sit comfortably in this framework. 


% However, real-world networks do not (typically) pop into existence fully-developed. 
% Much like the assumption of a ``Last Universal Common Ancestor'' in evolutionary biology \citep{darwin1964origin}, or the ``Past Hypothesis'' in cosmology \citep{hertog2024origin}, 
% % \cite{boltzmann1910vorlesungen}
% to better understand the state of a system at any given point in time, it is often insightful to model the history leading up to that point.  
% This temporal evolution naturally introduces a notion of causality. 
% \newpage

% \begin{comment}
% Another (more loose and flavourful) source of inspiration for this work comes from various research programs 
% % works 
% on the foundation of physics. 
% A particular approach to quantize gravity called \textit{causal sets} shares our love for causality and partial orders. 
% % \citep{bombelli1987space} 
% % has 
% % is in spirit quite aligned to our work. 
% % We propose that space-time at the smallest scales is in reality a causal set: a locally finite set of elements endowed with a partial order corresponding to the macroscopic relation that defines past and future. We explore how a Lorentzian manifold can approximate a causal set, noting in particular that the thereby defined effective dimensionality of a given causal set can vary with length scale. Finally, we speculate briefly on the quantum dynamics of causal sets, indicating why an appropriate choice of action can reproduce general relativity in the classical limit.
% The theory postulates that spacetime is atomic at the Planck scale and takes the form of a locally finite partial order, or a ``causal set'', corresponding to the macroscopic relation that defines past and future \citep{bombelli1987space, dowker2013introduction,dowker2023intrinsic}. 
% \end{comment}


% \begin{comment}
% \textit{Polya citations:}\\
% --- \citep{kaijser2017note}  A note on the rechargeable Polya urn scheme\\
% --- \citep{marcaccioli2019polya} A Pólya urn approach to information filtering in complex networks\\
% --- \citep{pekoz2019polya} Pólya urns with immigration at random times\\
% ``We study the number of white balls in a classical Pólya urn model with the additional feature that, at random times, a black ball is added to the urn. The number of draws between these random times are i.i.d. and, under certain moment conditions on the inter-arrival distribution, we characterize the limiting distribution of the (properly scaled) number of white balls as the number of draws goes to infinity. The possible limiting distributions obtained in this way vary considerably depending on the inter-arrival distribution and are difficult to describe explicitly. However, we show that the limits are fixed points of certain probabilistic distributional transformations, and this fact provides a proof of convergence and leads to properties of the limits. The model can alternatively be viewed as a preferential attachment random graph model where added vertices initially have a random number of edges, and from this perspective, our results describe the limit of the degree of a fixed vertex.''\\
% --- \citep{bassetti2009statistical} Statistical Mechanics of the Chinese Restaurant Process:
% lack of self-averaging, anomalous finite-size effects and condensation\\

% \textit{PPPA-like or pppa-related:}\\
% % --- \citep{krapivsky2000connectivity} Connectivity of Growing Random Networks\\
% % prop to degree to the k, k=1 critical for condensation\\
% --- \citep{artico2020rare} with other stats they show power-laws are indeed common \\
% --- \citep{crane2023root}  ( this one is about recoverying the first node so not sure) \\
% --- \citep{papadopoulos2012popularity}\\
% --- \citep{lee2015preferential} \\
% % \nocite{papadopoulos2012popularity,lee2015preferential,crane2023root}
% --- \citep{du2025proof} (the recent paper from arxiv) A Proof of The Changepoint Detection Threshold Conjecture
% in Preferential Attachment Models \\
% --- \citep{eikmeier2019triangle} TRIANGLE PREFERENTIAL ATTACHMENT HAS POWER-LAW DEGREES AND EIGENVALUES; EIGENVALUES ARE MORE STABLE TO
% NETWORK SAMPLING


% \textit{Sparse network models:}\\
% --- \citep{lunde2023subsampling} subsampling sparse graphons under minimal assumptions\\
% --- \citep{bianconi2022grand} Grand Canonical Ensembles of Sparse Networks and Bayesian Inference\\
% --- \citep{bianconi2022statistical} Statistical physics of exchangeable sparse simple networks, multiplex networks and simplicial complexes\\
% ``Exchangeability is a desired statistical property of network ensembles requiring their invariance upon relabelling of the nodes. However combining sparsity of network ensembles with exchangeability is challenging. Here we propose a statistical physics framework and a Metropolis-Hastings algorithm defining exchangeable sparse network ensembles. The model generates networks with heterogeneous degree distributions by enforcing only global constraints while existing (non exchangeable) exponential random graphs enforce an extensive number of local constraints. This very general theoretical framework to describe exchangeable networks is here first formulated for uncorrelated simple networks and then it is extended to treat simple networks with degree correlations, directed networks, bipartite networks and generalized network structures including multiplex net- works and simplicial complexes. In particular here we formulate and treat both uncorrelated and correlated exchangeable ensembles of simplicial complexes using statistical mechanics approaches.''\\


% \textit{Conditional indep issues:}\\
% --- \citep{montague2018graphical} GRAPHICAL MARKOV MODELS FOR INFINITELY MANY VARIABLES
% \\
% \begin{comment}
% ``Representing the conditional independences present in a multivariate random vector via graphs has found widespread use in applications, and such representations are popularly known as graphical models or Markov random fields. These models have many useful properties, but their fundamental attractive feature is their ability to reflect conditional independences between blocks of variables through graph separation, a consequence of the equivalence of the pairwise, local, and global Markov properties demonstrated by Pearl and Paz (1985). Modern-day applications often necessitate working with either an infinite collection of variables (such as in a spatial-temporal field) or approximating a large high-dimensional finite stochastic system with an infinite-dimensional system. However, it is unclear whether the conditional independences present in an infinite-dimensional random vector or stochastic process can still be represented by separation criteria in an infinite graph. In light of the advantages of using graphs as tools to represent stochastic relationships, we undertake in this paper a general study of infinite graphical models. First, we demonstrate that na ̈ıve extensions of the assumptions required for the finite case results do not yield equivalence of the Markov properties in the infinite-dimensional setting, thus calling for a more in-depth analysis. To this end, we proceed to derive general conditions which do allow representing the conditional independence in an infinite-dimensional random system by means of graphs, and our results render the result of Pearl and Paz as a special case of a more general phenomenon. We conclude by demonstrating the applicability of our theory through concrete examples of infinite-dimensional graphical models.''\\
% \end{comment}
% --- \citep{engelke2022graphical} (less is sebastian work, levy processes, maximal value theory, infinite measures, etc)\\


% \textit{Temporal networks and causal:}\\
% --- \citep{misiakos2025learning} LEARNING DAGS AND ROOT CAUSES FROM TIME-SERIES DATA
% \\
% \begin{comment}
%     We introduce DAG-TFRC, a novel method for learning directed acyclic graphs (DAGs) from time series with few root causes. By this, we mean that the data are generated by a small number of events at certain, unknown nodes and time points under a structural vector autoregression model. For such data, we (i) learn the DAGs representing both the instantaneous and time-lagged dependencies between nodes, and (ii) discover the location and time of the root causes. For synthetic data with few root causes, DAG-TFRC shows superior performance in accuracy and runtime over prior work, scaling up to thousands of nodes. Experiments on simulated and real-world financial data demonstrate the viability of our sparse root cause assumption. On S&P 500 data, DAG-TFRC successfully clusters stocks by sectors and discovers major stock movements as root causes.
% \end{comment}


% \textit{Different network models and higher-order ones:}\\
% --- \citep{bianconi2017emergent} Emergent Hyperbolic Network Geometry\\
% ``A large variety of interacting complex systems are characterized by interactions occurring between more than two nodes. These systems are described by simpli- cial complexes. Simplicial complexes are formed by simplices (nodes, links, triangles, tetrahedra etc.) that have a natural geometric interpretation. As such simplicial complexes are widely used in quantum gravity approaches that involve a discretization of spacetime. Here, by extending our knowledge of growing complex networks to growing simplicial complexes we investigate the nature of the emergent geometry of complex networks and explore whether this geometry is hyperbolic. Specifically we show that an hyperbolic network geometry emerges spontaneously from models of growing simplicial complexes that are purely combinatorial. The statistical and geometrical properties of the growing simplicial complexes strongly depend on their dimensionality and display the major universal properties of real complex networks (scale-free degree distribution, small-world and communities) at the same time. Interestingly, when the network dynamics includes an heterogeneous fitness of the faces, the growing simplicial complex can undergo phase transitions that are reflected by relevant changes in the network geometry.''\\
% --- \citep{bianconi2016network} Network geometry with flavor: from complexity to quantum geometry\\

% \textit{Causal inference in networks:}\\
% --- \citep{ogburn2024causal} causal inference for social network data\\
% --- \citep{ogburn2020causal} Causal inference, social networks and chain graphs\\
% both above are more focus on individual treatment effect and the network model is different, it is graphony\\
% ---

% \textit{Other exchangeable models for networks:}\\
% --- \citep{crane2016edge} EDGE EXCHANGEABLE MODELS FOR NETWORK DATA\\
% --- \citep{wiqvist2019partially} Partially Exchangeable Networks and Architectures for Learning Summary Statistics in Approximate Bayesian Computatio\\
% --- \citep{veitch2015class} THE CLASS OF RANDOM GRAPHS ARISING FROM EXCHANGEABLE RANDOM MEASURES (peter)\\

% \textit{Related to network size and subsampling:}\\
% --- \citep{smith2016empirical} Empirical Reference Distributions for Networks of Different Size\\
% % Network analysis has become an increasingly prevalent research tool across a vast range of scientific fields. Here, we focus on the particular issue of comparing network statistics, i.e. graph-level measures of network structural features, across multiple networks that differ in size. Although “normalized” versions of some network statistics exist, we demonstrate via simulation why direct comparison is often inappropriate. We consider normalizing network statistics relative to a simple fully parameterized reference distribution and demonstrate via simulation how this is an improvement over direct comparison, but still sometimes problematic. We propose a new adjustment method based on a reference distribution constructed as a mixture model of random graphs which reflect the dependence structure exhibited in the observed networks. We show that using simple Bernoulli models as mixture components in this reference distribution can provide adjusted network statistics that are relatively comparable across different network sizes but still describe interesting features of networks, and that this can be accomplished at relatively low computational expense. Finally, we apply this methodology to a collection of ecological networks derived from the Los Angeles Family and Neighborhood Survey activity location data.
% --- \citep{grannis2004sampling} SAMPLING THE STRUCTURE OF LARGE-SCALE SOCIAL NETWORKS\\


% \textit{Poset focus:}\\
% -- \citep{taeb2024model} model selection over partially ordered sets? \\
% -- \citep{snellman2025polytope} polytope business; The Polytope of Probability Functions
% on a Finite Poset\\
% --- \citep{rhee1990matrix} A MATRIX REPRESENTATION OF POSETS AND ITS APPLICATIONS\\
% remove, i don't get.\\
% --- \citep{mwafise2024machine} Machine Learning and Data Analysis Using Posets: A Survey\\


% \textit{Other ideas of symmetry for causality that are SUPER RELEVANT/RELATED:}\\

% --- \citep{blom2020conditional} Conditional independences and causal relations implied by sets of equations **** (very relevant title)\\
% ``Real-world complex systems are often modelled by sets of equations with endogenous and exogenous variables. What can we say about the causal and probabilistic aspects of vari- ables that appear in these equations without explicitly solving the equations? We make use of Simon’s causal ordering algorithm (Simon, 1953) to construct a causal ordering graph and prove that it expresses the effects of soft and perfect interventions on the equations un- der certain unique solvability assumptions. We further construct a Markov ordering graph and prove that it encodes conditional independences in the distribution implied by the equations with independent random exogenous variables, under a similar unique solvability assumption. We discuss how this approach reveals and addresses some of the limitations of existing causal modelling frameworks, such as causal Bayesian networks and structural causal models.''\\

% --- \citep{brightwell2011order} ORDER-INVARIANT MEASURES ON CAUSAL SETS\\
% ``A causal set is a partially ordered set on a countably infinite ground-set such that each element is above finitely many others. A natural extension of a causal set is an enumeration of its elements which respects the order.\\
% We bring together two different classes of random processes. In one class, we are given a fixed causal set, and we consider random natural extensions of this causal set: we think of the random enumeration as being generated one point at a time. In the other class of processes, we generate a random causal set, working from the bottom up, adding one new maximal element at each stage.\\
% Processes of both types can exhibit a property called order-invariance: if we stop the process after some fixed number of steps, then, conditioned on the structure of the causal set, every possible order of generation of its elements is equally likely.
% We develop a framework for the study of order-invariance which includes both types of example: order-invariance is then a property of probability measures on a certain space. Our main result is a description of the extremal order-invariant measures.''


% \textit{Comparing causal models:}\\
% --- \citep{otsuka2022equivalence} On the Equivalence of Causal Models: A Category-Theoretic Approach \\
% --- \citep{lorenz2023causal} Causal models in string diagrams\\

% \textit{Inference in SEMs:}\\
% --- \citep{misiakos2024learning} LEARNING DAGS FROM DATA WITH FEW ROOT CAUSES\\
% (they are the ones with fourrier transform over poset and such they seem to do cool work)\\
% --- \\

% \textit{Causal structure learning:}\\
% --- \citep{squires2023causal} Causal Structure Learning: a Combinatorial Perspective\\


% \textit{Physics foundations:}\\
% % On the foundations of physics... 
% % \paragraph{Causal sets.}
% %
% Another (more loose and flavourful) source of inspiration for this work comes from various research programs 
% % works 
% on the foundation of physics. 
% We now quickly 
% % mention 
% highlight 
% a few of them 
% so as to not digress too much. 
% --- causal sets \\
% Classical sequential growth dynamics for causal sets\\
% \textit{causal sets} 
% A particular approach to quantize gravity called \textit{causal sets} shares our love for causality and partial orders. 
% % \citep{bombelli1987space} 
% % has 
% % is in spirit quite aligned to our work. 
% % We propose that space-time at the smallest scales is in reality a causal set: a locally finite set of elements endowed with a partial order corresponding to the macroscopic relation that defines past and future. We explore how a Lorentzian manifold can approximate a causal set, noting in particular that the thereby defined effective dimensionality of a given causal set can vary with length scale. Finally, we speculate briefly on the quantum dynamics of causal sets, indicating why an appropriate choice of action can reproduce general relativity in the classical limit.
% The theory postulates that spacetime is atomic at the Planck scale and takes the form of a locally finite partial order, or a ``causal set'', corresponding to the macroscopic relation that defines past and future \citep{bombelli1987space, dowker2013introduction,dowker2023intrinsic}. \\
% % \nocite{bombelli1987space,dowker2013introduction,dowker2023intrinsic,wallden2013causal,dowker2020symmetry,dowker2020manifestly} 
% % \cite{bombelli1987space,dowker2013introduction,dowker2023intrinsic,wallden2013causal,dowker2020symmetry,dowker2020manifestly} 
% % has in spirit 
% % In particular, causal sets, a particular program to discretize 
% % various attempts to discretize gravity 
% % The causal sets program is an approach to quantum gravity. Its founding principles are that spacetime is fundamentally discrete (a collection of discrete spacetime points, called the elements of the causal set) and that spacetime events are related by a partial order. This partial order has the physical meaning of the causality relations between spacetime events.
% % *** ...the quantum people? \\
% -- the trilogy of causality \citep{gogioso2022combinatorics,gogioso2023topology,gogioso2023geometry} \\
% -- Aleks 
% \citep{kissinger2019categorical,simmons2024complete, simmons2022higher,jacobs2021causal,jacobs2019causal}\\
% \citep{kissinger2017equivalence,hefford2022pre} \\
% -- Nick Ormrod from the event causality \citep{ormrod2024quantum}\\
% -- relational quantum mechanics rovelli?? \citep{Rovelli2021-ROVHMS} \\
% suffice to say the emphasis on the relational nature of reality....
% % the trilogy of causality \citep{gogioso2022combinatorics,gogioso2023topology,gogioso2023geometry} 
% \begin{comment}
% Causal modeling with infinitely many variables. \\
% \citep{peters2021causal, halpern2022reasoning}  
% \end{comment}
% \end{comment}

% \end{acknowledgements}

% \newpage
% References
% \bibliography{uai2025-template}

% these aren't sorted yet, but are references I likely want to mention and/or read closer