%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
%%%%%%%%%%%%% IMPORTANT BEGIN 
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% Real-world networks grow over time; statistical models based on node exchangeability are not appropriate.  Instead of constraining the structure of the \textit{distribution} of edges, we propose that the relevant symmetries refer to the \textit{causal structure} between them.  We first enumerate the 96 causal directed acyclic graph (DAG) models over pairs of nodes (dyad variables) in a growing network with finite ancestral sets that are invariant to node deletion. We then partition them into 21 classes with ancestral sets that are closed under node marginalization. Several of these classes are remarkably amenable to parallelization.  As an example, we highlight a simple model that exhibits flexible power-law degree distributions and emergent phase transitions in sparsity, which we characterize analytically.  With few parameters and much conditional independence, our proposed framework provides natural baseline models for causal inference in relational data.  
% \textcolor{purple}{\textit{Keywords:}\\
% causal DAGs, 
% growing networks, 
% structural causal models (SCMs),
% scale-free networks, 
% preferential attachment
% causal inference, 
% relational data
% }
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
%%%%%%%%%%%%% IMPORTANT END
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
%%
% Real-world networks grow over time; statistical models based on node exchangeability are not appropriate.  Instead of constraining the structure of the \textit{distribution} of edges, we propose that the relevant symmetries refer to the \textit{causal structure} between them.  We first enumerate the 96 causal directed acyclic graph (DAG) models over pairs of nodes (dyad variables) in a growing network with finite ancestral sets that are invariant to node deletion. We then partition them into 21 classes with ancestral sets that are closed under node marginalization. Several of these classes are remarkably amenable to parallelization.  As an example, we highlight a simple model that exhibits flexible power-law degree distributions and emergent phase transitions in sparsity, which we characterize analytically.  With few parameters and much conditional independence, our proposed framework provides natural baseline models for causal inference in relational data.  
%%
\begin{abstract}
Real-world networks grow over time; statistical models based on node exchangeability are not appropriate.  
Instead of constraining the structure of the \textit{distribution} of edges, 
we propose that the relevant symmetries refer to the \mbox{\textit{causal structure}} between them.  
We first enumerate the 96 causal directed acyclic graph (DAG) models over pairs of nodes (dyad variables) in a growing network with finite ancestral sets that are invariant to node deletion. 
We then partition them into 21 classes with ancestral sets that are closed under node marginalization. 
Several of these classes are remarkably amenable to distributed and asynchronous evaluation.  
As an example, we highlight a simple model that exhibits flexible power-law degree distributions and emergent phase transitions in sparsity, which we characterize analytically.  
With few parameters and much conditional independence, our proposed framework provides natural baseline models for causal inference in relational data. 
\end{abstract}


\begin{comment}
\begin{abstract}
Real-world networks grow over time; 
statistical models based on node exchangeability are not appropriate.  
Instead of constraining the structure of the \textit{distribution} of edges, 
we propose that the relevant symmetries refer to the \textit{causal structure} between them.  
% We propose a framework describing the classes of  causal directed acyclic graph (DAG) models over pairs of nodes (dyads) in a growing network,
% whose causal ordering remains invariant upon interventions on 
% %the nodes and edges 
% the growing network. 
We first enumerate the 96 causal directed acyclic graph (DAG) models over 
\potentiallyImprove{pairs of nodes (dyad variables) in a growing network with finite ancestral sets}
% edge variables
that are invariant to node deletion. 
% that are invariant to node deletion.  \potentiallyRemove{and have finite ancestral sets}. 
% \potentialAlternative{We first enumerate the 96 causal directed acyclic graph (DAG) models over edge variables that are invariant to node deletion.}{We first enumerate the 96 causal directed acyclic graph (DAG) models over edge variables of a growing network that have finite ancestral set and are invariant to deleting nodes of the growing network.}
% \potentialAlternative{We first enumerate the 96 causal directed acyclic graph (DAG) models over edge variables that are invariant to node deletion.}{We first enumerate the 96 causal directed acyclic graph (DAG) models over edge variables of a growing network that have finite ancestral set and are invariant to deleting nodes of the growing network.}
% We first enumerate the 96 causal directed acyclic graph (DAG) models 
% DAG structures 
% over edge variables that are invariant to node deletion.  
% over edge variables of a growing network that have finite ancestral set and are invariant to deleting nodes of the network. 
We then partition them into 21 classes \potentiallyImprove{with ancestral sets that are closed under node marginalization.} 
%that are closed under node marginalization.  
% 
Several of these classes are remarkably %\potentiallyRemove{allow for} 
amenable to parallelization.  
% ***
% \potentiallyImprove{To show case of our framework, we propose a simple model that} 
\potentiallyImprove{As an example, 
we 
highlight 
% propose 
a simple model that} 
% \potentiallyImprove{One in particular} 
exhibits flexible power-law degree distributions and emergent phase transitions in sparsity, 
which we characterize analytically.  
With few parameters and much conditional independence, 
% the 
\potentiallyImprove{our} 
proposed framework provides natural baseline models for causal inference in relational data. 
% % *** intervention and counterfactual?? 
% % Moreover, the proposed models provide a natural framework for addressing causal questions in relational data. 
% More broadly, the proposed models provide a natural baseline framework for causal inference in relational data, both local (intervening on individual relations) and global (intervening on parameters). 
% ** opens up...
\end{abstract}
\end{comment}
\begin{comment}
\begin{abstract}
% In this work, we provide a taxonomy for causal graphical models over dyads between a growing set of nodes.  
% We built a framework to describe natural causal models over pairs of nodes  in a growing network. % between a growing set of nodes. 
We propose a framework describing the classes of causal models over pairs of nodes (dyads) in a growing network,
whose causal ordering remains invariant upon interventions on 
%the nodes and edges 
the growing network. 
% In this work, we provide a framework describing natural, causal models  causal graphical models over dyads between a growing set of nodes.  
By systematically searching the space of possible causal models for growing networks, 
we find statistically-streamlined models for growing networks that easily exhibit many of the emergent features characteristic of real-world networks.   
% \todo{PPA and analyrr}
% In particular, we propose a parallelelized preferential attachment model, 
In particular, we propose a simple model --- parallelized preferential attachment --- and analytically characterize its asymptotic degree distribution.  
Neatly, with only 4 free parameters, this model already exhibits easily controllable phase transitions: 
% having three sparsity regimes: 
% controlled by $2$ of the parameters: 
a sparse regime with power-law degree distribution and constant average degree;
an intermediate regime with logarithmic average degree;  and 
a dense regime with polynomial degree distribution. 
% Moreover, the proposed models provide a natural framework for addressing causal questions in relational data. 
More broadly, the proposed models provide a natural baseline framework for causal inference in relational data, both local (intervening on individual relations) and global (intervening on parameters). 
% ** opens up...
\end{abstract}
\end{comment}