\spaceendsubsubsection
% \section{Applications} % and Extensions}
\section{Some Applications} % and Extensions}
\label{sec:extensions}
\spacebefsection
\novo{In this section, 
we discuss 
% two advantages of our framework; 
how having causal DAGs as generative models for the networks leads to two advantageous features of our framework:  
ease of sampling and inference (\Cref{sec:inference}), and ability to answer interventional and counterfactual causal queries (\Cref{sec:deletion}).}
\spaceendsection
% \subsection{Extrapolating from small growing networks}%inference and generalization
\subsection{Inference and Generalization}%inference and generalization
\label{sec:inference}
\spacebefsection
This flexibility of asymptotic behaviors from a simple model is a useful property for extrapolating from limited data.  
For instance, 
consider observing a growing network that is still in its relative infancy.  
\potentiallyImproveN{The average degree is increasing as a function of the number of nodes, but it is slowing down; 
will it converge to some constant value, or if not, at what rate might it increase?}  
The degree distribution is currently more spread than a network with independent edges, 
 but there is not yet a region that looks linear on a log-log plot;\footnote{\potentiallyImproveN{Estimating the power-law exponent of a degree distribution is notoriously tricky \citep{clauset2009power}.  %***check other citations  \citep{clauset2009power,artico2020rare}. 
No finite network is truly scale-free; 
even if there is an obvious power-law that fits the majority of the degree distribution, 
there are necessarily deviations at the extremities.}}  
% there are typically significant deviations from it in the frequency of nodes with small degrees.}  
% } 
what might its scaling exponent be once many more nodes are added?  

% Estimating the power-law exponent of a degree distribution is notoriously tricky \citep{clauset2009power,artico2020rare}.  %***check other citations  \citep{clauset2009power,artico2020rare}. 
% No finite network is truly scale-free; 
% even if there is an obvious power-law that fits the majority of the degree distribution, 
% there are necessarily deviations at the extremities.
By fitting the parameters of a simple structural equation to initial observations, 
one might still be able to predict the %estimate such quantities related to 
% its 
the asymptotic behavior of the growing network, 
despite those features not yet being present.  % ***
% even though those features are not present yet.
%
% \potentiallyImprove{
% That is, by quantitatively estimating the causal mechanisms, 
% instead of fitting  distribution itself, 
% it is possible to make more reasonable predictions about what might happen in the future.}  
%
%
% Moreover, many growing networks that \textit{would} develop a power-law degree distributions 
% are simply not large enough to exhibit an obvious exponent.  
\spaceendsubsection 
% estimating the effect of 
\subsection{Interventions and Counterfactuals}
\label{sec:interventionandcounterfactuals}
\spacebefsubsection
\potentiallyImprove{
% In this section, 
We now
illustrate  
how 
% our framework can be used 
to use our framework 
to answer interventional and counterfactual causal queries % \citep{pearl2009causality} 
using our beloved running examples.} 
%


Suppose you are about to submit a publication, 
and you want to add a few more citations to your bibliography to help it reach a larger audience.  
To estimate the net effect of \potentiallyImprove{this strategy,}
% \potentiallyImprove{such strategic placement,}
% strategic citations, 
one could fit the parameters of 
% a causal model,
a \potentiallyImprove{causal model}  
% **** meta dag??
(such as the {\ppaAcron} model or its extensions)
to \novo{the current state of the citation network relevant to you.}  
% \potentiallyImprove{the current citation network.} 
% ***(citation network relevant to you? ego citation network?) 

By approximating the strength of various causal mechanisms, 
\potentiallyImprove{one can run the model forward to estimate the number of 
additional citations one might receive as a result.}  
% *** (more details!!)
This is an example of an \textit{interventional} question, 
as the answer involves quantifying (the result of performing an action) 
over a \textit{distribution} of possible futures.  

Now suppose you have an older publication that you really feel should have more citations, 
and you are deciding how much to regret not promoting it more \potentiallyImprove{at the time you published it.}  
This is an example of a \textit{counterfactual} question,  
as now the answer involves quantifying the difference between 
one \textit{particular} outcome (that was actually observed), 
and another (that \textit{could} have occurred, but did not).  
%

To estimate \potentiallyImprove{the net effect of such 
% fictional 
``could/should have been'' 
actions, 
one} can use the structural equations of \potentiallyImprove{a causal model.}   
For example, the randomness in Equation~\eqref{eq:PPAbernoulli} \potentiallyImprove{of the {\ppaAcron} model} can be represented explicitly by introducing an (unobserved) random variable: 
\begin{align}
    % \epsilon_{ij}^{ } &= \text{Uniform}[0,1] \\
       \epsilon_{ij}^{ } &\sim \mathcal{U}\textit{niform}\big(0,1\big) \\
    x_{ij}^{ } &= \text{sign}\big(p_{ij}^{ } - \epsilon_{ij}^{ }\big) \label{eq:PPASEM}
\end{align}
% In the counterfactual world, the $p_{ij}^{ }$ and $x_{ij}^{ }$ may change, but the 
% **** something here about pij fixed..
% From the estimated parameters of the model, and the specific set of observed results, 
% one can use this form to estimate the likelihood of such counterfactual changes.  
\potentiallyImprove{From the estimated parameters of the model, 
% and the specific set of observed results, 
% and the specific set of observed results, 
and the actual observed data, 
%you wish counterfactually query, 
one can use this form to estimate the likelihood of such counterfactual changes.}  % ***
\potentiallyImprove{Essentially, this involves performing bayesian updates 
% on 
to the $p_{ij}^{ }$ and $x_{ij}^{ }$, while treating the $\epsilon_{ij}^{ }$ as fixed \citep{pearl2009causality}.}  

% \spaceendsubsection
% \subsection{Incorporating Clustering}
% \label{sec:clustering}
% \spacebefsubsection

% \spaceendsubsection
% \subsection{A ``Bottom-up'' Causal Structure}
% \label{sec:bottomupcausalstructure}
% \spacebefsubsection
% \spaceendsubsection
\begin{comment}
    

\section{Related Work}
\label{sec:relatedwork}
\spacebefsection





% \citep{lauritzen1996graphical}
% by \citet{lauritzen2017exchangeability}.
% (Proposition 3 and Theorem 2 on their first paper and Theorem 4 in here). 
The initial inspiration for this work 
% come from a 
was a 
% neat 
paper by \citet{lauritzen2017exchangeability} connecting maximum entropy distributions over (undirected simple) graphs with $n$ nodes 
% (\ie, ERGMs), 
(a.k.a., ERGMs), 
% (\ie, exponential random graph models, or ERGMs) 
and graphical models describing the conditional independence structure of 
% their 
the ERGMs's 
${n \choose 2}$ dyads 
% random 
variables.  
% (Note that ERGMs are exchangeable over the nodes, but their random variables are the ${n \choose 2}$ dyads.) 
% So, they ask about what are the allowed conditional independent statements between the dyads, under the constraint that the distribution is exchangeable over nodes. 
They showed that there are only 6 different classes of graphical models and described the sufficient statistics of the associated ERGMs. 
(Note that ERGMs are exchangeable over the nodes, 
but their random variables are the 
% ${n \choose 2}$ 
dyads.) 
% the conditional independence between the ${n \choose 2}$ dyads variables 
% statements that are implied by graphical models.  
% in which
% In that paper in which the authors' characterize the types of independence structure between the ${n \choose 2}$ dyads variables of an Exponential Random Graph Models (ERGMs) for simple graphs with $n$ nodes. 
%
% which described the conditional independence structure of the dyads of exchangeable networks models over a finite number of nodes, that is, ERGMs. 
% Exponential random network graph models (ERGMs) over simple graphs with $n$ nodes. 
% We study conditional independence relationships for random networks and their interplay with exchangeability. We show that, for finitely exchangeable network models, the empirical subgraph densities are maximum likelihood estimates of their theoretical counterparts. We then characterize all possible Markov structures for finitely exchangeable random graphs, thereby identifying a new class of Markov network models corresponding to bidirected Kneser graphs. In particular, we demonstrate that the fundamental property of dissociatedness corresponds to a Markov property for exchangeable networks described by bidirected line graphs. Finally we study those exchangeable models that are also summarized in the sense that the probability of a network depends only on the degree distribution, and we identify a class of models that is dual to the Markov graphs of Frank and Strauss. Particular emphasis is placed on studying consistency properties of network models under the process of forming subnetworks and we show that the only consistent systems of Markov properties correspond to the empty graph, the bidirected line graph of the complete graph and the complete graph. 
%%
% A key insight of this work was to study invariance of 
% % the 
% causal mechanisms instead of node exchangeability, 
% connecting 
% the framework of 
% structural causal models (SCM) with 
% statistically attractive 
% generative models for growing networks.
%
\potentiallyImprove{
A key insight of this work is to impose invariance of 
the 
causal mechanisms 
% for generating the network 
instead of node exchangeability, 
thereby 
connecting 
the framework of 
structural causal models (SCM) with 
statistically attractive 
generative models for growing networks.
}
%**** 
% this work is to consider invariance of the causal mechanisms instead of node exchangeability. %**** 
\spaceendsection
\end{comment}