\section{Streamlining Network Models}
% \section{A Simple and Flexible Model}
\label{sec:parttwoppa}
\spacebefsection
%
% So far we have defined a convenient family of causal meta-DAGs between the dyads of the growing network. 
Our classification of invariant causal meta-DAGs 
% gives 
imposes 
requirements for where causal arrows can appear between the dyad variables, 
but so far we have said nothing of what the structural equations of a growing network might be. 
% The requirements imposed in the network model are still quite mild;  
% flexible 
\potentiallyImproveN{
% To determine the outcome of each dyad, 
Indeed, one \textit{could} use a different function for each dyad
% that depends in a complex way on all of its parents, 
with a complex dependency on all of its parents, 
and such a model would still generate network distributions that are faithful to the causal structure of our meta-DAGs.}  
% One \textit{could} use different functions for each dyad, depending in a complex way on all of its parents, 
% to determine its outcome and still generate distributions that are faithful to our meta-DAGs. 


% But this wouldn't quite be in the spirit of our framework. 
However, in the spirit of our framework of ``invariance of causal mechanisms'', 
we propose using the \textit{same} structural equation for all dyad variables in the model.  
Since the number of parents depends on the position of the child dyad variable, 
such a function must allow for an arbitrary number of inputs.  
\potentiallyImproveN{A natural choice is to define the function in terms of 
% various 
summary statistics of the different types of causal parents.} 
% \potentiallyImprove{
% %Such a function would still need to have a flexible number of inputs, 
% A natural choice is to use basic summary statistics of the different types of causal parents.} 
% As such a function would still need to have a flexible number of inputs, 
% a natural choice is to use basic summary statistics of the different types of causal parents.  
% Instead, we suggest considering structural equations for the dyads given by a function that is invariant to permutations within the classes of causal arrows parents. 
% 
% \spaceendsubsection
% % \subsection{Our {\ppaName} Model}
% % \subsection{{\ppaName} Model}
% \subsection{Our {\ppaAcron} Model}
% \label{sec:ourppamodel}
% \spacebefsubsection

In Section~\ref{sec:ourppamodel}, 
we present a simple model for binary (edge or no-edge) dyad variables  
% (edge, represented by a $1$, 
% % \mbox{$X_{ij} = 1$}, 
% and a no-edge by $0$)
% \mbox{$X_{ij} = 0$})
that implements preferential attachment using the $\HubName$ and $\PathName$ causal arrows.  %\novo{, generating (simple undirected) growing networks with controllable power-law degree distributions} (Section~\ref{sec:ourppamodel}).  
% While many variations of preferential attachment already exist, 
% the authors feel it worth
More important than the model itself is the rather surprising insight it provides:
\textit{reducing} dependencies between the dyad variables leads to \textit{increased} diversity in 
the 
asymptotic behavior of the growing network.  
\spaceendsection
\subsection{Basic Preferential Attachment}
% \subsection{Context: Preferential Attachment}
% \subsection{Preferential Attachment (Mise-en-sc\'ene)}%Parallelized Power Laws
% \subsection{Mise-en-sc\'ene: Preferential Attachment}
\label{sec:prefattachmentcontext}
\spacebefsubsection
% Our framework naturally contains (generalizations of) well-known models.  
While the relationship between preferential growth and scale-free distributions had 
already been described by several authors \citep{eggenberger1923statistik,simon1955class,price1965networks},  
% long 
% , barabasi2012luck}, 
% the effect itself is perhaps best exemplified by the citation count of %overwhelming number of citations garnered by 
the effect itself is perhaps best exemplified by the overwhelming number of citations garnered by 
% Barab\'asi and Albert 
\citet{barabasi1999emergence}. %, 
% whose already impressive number of citations will be incremented by this very mention.  
% making the mechanism widely known in 
% whose names have become synonymous with power-law degree distributions
% the network science community.  
% has garnered a truly impressive number of citations.  
% clearly demonstrated 
% % made it widely \potentiallyImprove{famous in}
% % known to 
% the network science community.  
% So much so that their names have become synonymous with power-law degree distributions.  

The simplest statement of their model 
\citep{posfai2016network}
has a single parameter, $m$.  
Initialize the network with clique of $m$ nodes. 
At each iteration, select $m$ nodes proportional to their current degree, 
and add a new node connected to each of these selected nodes.  
% with its edges 
% by 
% connecting it to these $m$ selected nodes.} 
% nd a new node is
% added to the graph with its edges connecting to these m
% selected nodes.

Many extensions to this model have been described 
% \citep{posfai2016network}. 
%\citep{posfai2016network, ray2024stochasticthesis}.
\citep{ray2024stochasticthesis}: 
% \potentiallyImprove{Such as 
introducing parameters to control correlations between neighboring degrees \citep{avin2020mixed} or to promote clustering \citep{eikmeier2019triangle}; 
adding node covariates \citep{bianconi2001competition, lee2015preferential}; 
and considering alternative attachment functions  \citep{krapivsky2000connectivity}.   
% Nonlinear functions of the degrees \citep{krapivsky2000connectivity}.  
% Parameters can be introduced to control correlations between neighboring degrees \citep{avin2020mixed} or promote clustering \citep{eikmeier2019triangle}.  
% Node covariates can be used to introduce inhomogeneity \citep{bianconi2001competition, lee2015preferential}. 
% \citep{bianconi2001competition, lee2015preferential,bollobas2001degree}.  
Here, we draw attention to a particular line of modifications that reduce the \potentiallyImproveN{statistical} correlations between pairs of edges \citep{bollobas2007phase,wang2020directed}.  
\spaceendsubsubsection
\subsubsection{Poissonified Preferential Attachment} % (Other Models)}
\label{sec:theirsppamodel}
\spacebefsubsubsection
% In most formulations of a preferential attachment model, 
% there is a fixed parameter $m$ specifying the number of edges that each new node $j$ makes with the previous nodes. 
\potentiallyImproveN{Most models of preferential attachment have a  
% fixed 
parameter $m$, 
% specifying the number of edges that each new node $j$ makes with the previous nodes. 
specifying the precise number of edges that each new node $j$ makes with the previous nodes.  
This results in a small anticorrelation between the dyad variables \dyadpair{\BoxNodeS{}}{j} within each iteration.} 
% (\ie, dyads containing node $j$ and a previous node $\BoxNodeS{}<j$) 
% \citep{ray2024stochasticthesis}.} % (about \bigO{\tfrac{1}{n}}.  
%*** add our picture showing the correlations!!
%
% \citep{van2024random, wang2020directed}
Alternatively, one could compute probabilities for each edge that are proportional to their degrees, such that $m$ edges \potentiallyImproveN{will be} 
added in expectation \citep{van2024random}.\footnote{Setting aside cases with probabilities greater than $1$. }
% : \mbox{\smash{$\tfrac{d_i}{\sum_i d_i} > \tfrac{1}{m}$}}  , which corresponds to a probability greater than $1$ for connecting to node $i$ (which can happen if \mbox{$m >2$}). }
% Such cases are unlikely to happen, but are technically possible when \mbox{$m >2$}.}
% \footnote{Setting aside cases where this cannot be done: \mbox{$\tfrac{d_i}{\sum_i d_i} > \tfrac{1}{m}$}, which gives a probability greater than 1.}
% By allowing the number of edges added to fluctuate appropriately, 
% the anticorrelation in the previous model can be removed 
% without qualitatively changing the asymptotic behavior; 
% the degree distribution has the same power-law tail \mbox{$p(d)\propto d^{-3}$} 
% and average degree \mbox{$\langle d\rangle = 2m$}.  
This does not qualitatively change the asymptotic behavior; 
the degree distribution has the same power-law tail \mbox{$p(d)\propto d^{-3}$} 
and average degree \mbox{$\langle d\rangle = 2m$} as before.  
%\citep{ray2024stochasticthesis}.  
% \commentToDo{*** \citep{jiao2015minimax} for generalizations of the multinomial-poison trick}
% 
% This idea is similar to that of ``poissonifying'' a multinomial distribution.  
% With the total number of counts fixed, the individual counts are anti-correlated.  
% By allowing the total number of items to fluctuate appropriately, 
% one is able to render the individual counts independent.  

By allowing the number of edges to be an implicit random variable, 
this modification renders the dyad variables \dyadpair{\BoxNodeS{}}{j} conditionally independent 
given the current degrees of the previous nodes.  
However, their outcomes still depend on the entire network up to the previous iteration.  
This is due to the fact that the sum of their edge probabilities has been scaled to be equal to $m$.  
Is this dependence on all previous dyad variables necessary? 
How much dependence can we remove  
while still retaining the hallmark features of preferential attachment models? 
% the \novo{essential feature of displaying a power-law degree distribution?} 
% That is, how much ``inessential dependence'' can we remove from a preferential attachment model 
% while still retaining the ``essential features'', such as a power-law degree distribution?  
% With this modification, now the edges are conditionally independent (given the current degrees).  
% However, the probability of each edge still depends on the entire graph up to the previous iteration.  
\spaceendsubsection
\subsection{{\ppaName}}
\label{sec:ourppamodel}
\spacebefsubsection
\potentiallyImprove{Motivated by this question, 
we note that only two of the seven 
types of causal arrows are essentially being asked for by preferential attachment: 
$\HubName$ and $\PathName$, 
and that their transitive closure implies 
significantly less dependence on previous dyad variables (see Fig.~\ref{fig:PosetMetaDAG}). 
% What if the probability of an edge were to depend \textit{only} on the degree statistics of the older node to which it might connect, 
% with zero dependence on the degrees of the other nodes in the network?  
% This would require only the $\HubName$ and $\PathName$ causal arrow \potentiallyImprove{(see causal meta-DAG in Fig.~\ref{Fig:ExampleGraphicalModelLocalDirectSeveral} \textit{top-left}).} 
As summary statistics for the structural equation, 
we use node degrees of the older node: 
$d_i^{\text{in}}$ corresponds to $\HubName$, 
and $d_i^{\text{out}}$ corresponds to $\PathName$.}  
% We then consider an instantiation of preferential attachment: 
% We take the edge probabilities to be an affine function of the degree statistics corresponding to these two types of causal arrows, leading to the model we call 
We take the edge probabilities to be an affine function of these statistics, 
leading to the model we call 
\emphWord{{\mbox{\ppaName}}}, or \emphWord{\mbox{\ppaAcron}}, 
\potentiallyImprove{as sampling from it is highly parallelizable (see discussion in Section~\ref{sec:discussion}).}  % evaluation than many other preferential attachment models
%
% Explicitly, we have  the  
% following probabilistic model for the edges of a growing network:
Explicitly:
% \vspace{-1pt}
\begin{align}
    x_{ij}^{ } &\sim \text{Bernoulli}\big(p_{ij}^{ }\big) \label{eq:PPAbernoulli} \\
    p_{ij}^{ } &= \frac{\alphaP + \thetain d_i^{\text{in}} + \thetaout d_i^{\text{out}}}{j-2+\alphaP +\betaP} \label{eq:PPAprob}\\
    d_i^{\text{in}} &= \sum_{\BoxNodeSS{}=i+1}^{j-1} x_{i\BoxNodeSS{}}^{ }  \quad\quad
    d_i^{\text{out}} = \sum_{\BoxNodeSS{}=1}^{i-1} x_{\BoxNodeSS{} i}^{ } \label{eq:PPAdout}    
\end{align}
where \mbox{$x_{ij} = 1$} indicates an edge between nodes $i$ and $j$ (and $\mbox{$x_{ij} = 0$}$ indicates no edge). 

%%%******
% The parameters $\thetain$ and $\thetaout$ correspond to $\HubName$ and $\PathName$ causal arrows, respectively. 
% If one uses the ordering of the nodes to orient the edges of the growing network, 
% {\hub} arrows refer to the \mbox{in-degree} 
% of the node $\BoxNodeS{}$ of the dyad \mbox{\dyadpair{\BoxNodeS{}}{j}} being decided 
% (\ie, dyads of the type \mbox{\dyadpair{\BoxNodeS{}}{i}}, where \mbox{$\BoxNodeS{}<i$}). 
% Whereas {\transitive} arrows refer to the \mbox{out-degree} of the node $\BoxNodeS{}$ 
% (\ie, dyads of the type \mbox{\dyadpair{i}{\BoxNodeS{}}}, where \mbox{$i<\BoxNodeS{}$}). 
% \footnote{Albeit not necessary, 
% due to the ordering of the nodes in the growing network,
% the presence of a dyad (\ie, an edge) can be thought as an arrow with the newer node as the tail and the older node as the head, or likewise as a direct edge from the newer node to the older node. 
% {\hub} and {\transitive} describe how dyads with different tail nodes (\ie, newer nodes) can directly influence each other, 
% and the case where the dyads that have the same tail node directly influence each other is described by {\forward} \textit{or} {\backward}.\label{footnotedirection}} 

% \novo{Note that this model has a remarkable degree of parallelization: it can be generated $\mathcal{O}(n)$ in time, where $n$ is the number of nodes in the network, in contrast to the $\mathcal{O}(n^2)$ time need .}
% This leads to a remarkable degree of parallelizability 
Surprisingly, while drastically \textit{decreasing the dependence} between the dyads, 
this model exhibits \textit{increased diversity} in its asymptotic behavior 
% (see Theorem~\ref{thm:PPAPhaseTransition} and Fig.~\ref{fig:PPARegimes}).  
(see Fig.~\ref{fig:PPARegimes}, Theorems~\ref{thm:PPAPhaseTransition} and \ref{thm:PPAPhaseTransitiondegdist}, proofs in Appendix~\ref{appendixDAPAProofs}).
\begin{figure}[h]
    \centering
    \begin{tikzpicture}[remember picture]
        % \tikzmath{\SquareSize=5.625;}
        \tikzmath{\SquareSize=6.0;}
        \tikzmath{\BottomRightX=0.0;}
        \tikzmath{\BottomRightY={-\SquareSize};}
        \tikzmath{\UpperLeftX={\BottomRightX+\SquareSize};}
        \tikzmath{\UpperLeftY={\BottomRightY+\SquareSize};}
        \tikzmath{\BoxThickness=2;}
        \tikzmath{\DiagThickness=1;}
    

    
        %%% BOX %%%
        \draw [line width=\BoxThickness] (\BottomRightX,\BottomRightY) -- (\UpperLeftX,\BottomRightY) -- (\UpperLeftX,\UpperLeftY) -- (\BottomRightX,\UpperLeftY) -- (\BottomRightX,\BottomRightY) -- (\UpperLeftX,\BottomRightY);
    
        %%% Diagonal %%%
        \draw [line width=\DiagThickness,dotted] (\UpperLeftX,\BottomRightY) -- (\BottomRightX,\UpperLeftY);
    
        %%% Parameters %%%
        \tikzmath{\offset=0.5;}
        \tikzmath{\X=\BottomRightX+\SquareSize/2;}
        \tikzmath{\Y=\BottomRightY-\offset;}
        \node[text=black,rotate=0,anchor=base] at (\X,\Y) {$\HubName$ \kern1pt strength parameter \kern1pt $\thetain$};
        \node[text=black,rotate=0,anchor=base] at (\BottomRightX,\Y) {$0$};
        \node[text=black,rotate=0,anchor=base] at (\UpperLeftX,\Y) {$1$};
    
        \tikzmath{\offset=0.45;}
        \tikzmath{\X=\BottomRightX-\offset;}
        \tikzmath{\Y=\BottomRightY+\SquareSize/2;}
        \node[text=black,rotate=90] at (\X,\Y) {$\PathName$ \kern1pt strength parameter \kern1pt $\thetaout$};
        \node[text=black,rotate=0] at (\X,\BottomRightY) {$0$};
        \node[text=black,rotate=0] at (\X,\UpperLeftY) {$1$};
    
    
        %%% Diag text %%%
        \def\SpacingObject{\vphantom{\big(}}
        % \tikzmath{\x=0.6;\y=-0.6;}
        \tikzmath{\x=1.0;\y=-1.0;}
        \node[text=\logColor,rotate=-45,anchor=south west] at (\x-0.05,\y-0.05) {\SpacingObject\textbf{\uline{log avg degree}} \kern6pt \textbf{\mbox{$\thetain+\thetaout=1$}}};
        \node[text=\logColor,rotate=-45,anchor=north west] at (\x+0.05,\y+0.05) {\SpacingObject\textbf{$\langle d \rangle$ $\rightarrow$ $2\kern1pt \alphaP\ln(n) + C$}};
        \tikzmath{\CenterX=\BottomRightX+\SquareSize/2;}
        \tikzmath{\CenterY=\BottomRightY+\SquareSize/2;}

        %%% UPPER RIGHT TEXT
        \tikzmath{\x=\SquareSize-0.05;\y=0.0;}
        % \node[text=\polyColor,rotate=0,anchor=north east] at (\x,\y) 
        %     {\SpacingObject\textbf{poly avg degree} };
        % \tikzmath{\x=\x;\y=\y-0.5;}
        % \node[text=\polyColor,rotate=0,anchor=north east] at (\x,\y) 
        %     {\SpacingObject\textbf{\mbox{$1<\thetain+\thetaout<2$}} };
        % \tikzmath{\x=\x;\y=\y-0.5;}
        \node[text=\polyColor,rotate=0,anchor=north east] at (\x,\y) 
            {\SpacingObject\textbf{\uline{poly avg degree}} };
        \tikzmath{\x=\x;\y=\y-0.5;}
        \node[text=\polyColor,rotate=0,anchor=north east] at (\x,\y) {\SpacingObject\textbf{\mbox{$\langle d\rangle\rightarrow C\times n^{\rho}_{ }$}}};
        % \node[text=\polyColor,rotate=0,anchor=north east] at (\x,\y) {\SpacingObject\textbf{\mbox{$\langle d\rangle\rightarrow C\times n^{\rho}_{ } \qquad p\big(d\big)  \propto  d^{-\gamma}$}}};
        \tikzmath{\x=\x;\y=\y-0.5;}
        \node[text=\polyColor,rotate=0,anchor=north east] at (\x,\y) 
            {\SpacingObject\textbf{\mbox{$\rho=\thetain+\thetaout-1$}}};
        % \tikzmath{\x=\x;\y=\y-0.5;}
        % \node[text=\constColor,rotate=0,anchor=north east] at (\x,\y) {\textbf{$p\big(d\big)  \propto  d^{-\gamma}$}};
        \tikzmath{\x=\x;\y=\y-0.5;}
        \node[text=\constColor,rotate=0,anchor=north east] at (\x,\y) {\textbf{$\gamma = \displaystyle\frac{2-\thetaout}{1-\thetaout}$}};

    
        % \tikzmath{\x=\SquareSize-2.7;\y=-0.4;}
        % \tikzmath{\y=\y-2.35;}
        % \renewcommand{\eqspace}{\kern2pt}
        % \node[text=\constColor,rotate=0] at (\x-1.52,\y-0.37) {\textbf{const avg degree}};
        % \node[text=\constColor,rotate=0] at (\x-1.62,\y-0.77) {\textbf{(\mbox{$\thetain+\thetaout<1$})}};
        % \node[text=\constColor,rotate=0] at (\x-0.52,\y-1.9) {\textbf{$\begin{aligned}
        % \langle d \rangle &= \displaystyle\frac{2\alpha}{1-\big(\thetain+\thetaout\big)}\\[-0.3em]
        % p\big(d\big)  &\propto  d^{-\gamma} \qquad \gamma  =  1+\displaystyle\frac{1}{\thetain}
        % \end{aligned}$}};
        % \tikzmath{\x=0.05;\y=-3.75;}
        \tikzmath{\x=0.05;\y=-3.65;}
        \renewcommand{\eqspace}{\kern2pt}
        % \node[text=\constColor,rotate=0,anchor=south west] at (\x,\y) {\textbf{const avg degree}};
        % \tikzmath{\x=\x;\y=\y-0.5;}
        % \node[text=\constColor,rotate=0,anchor=south west] at (\x,\y) {\textbf{\mbox{$0<\thetain+\thetaout<1$}}};
        % \tikzmath{\x=\x;\y=\y-0.5;}
        \tikzmath{\x=\x;\y=\y-0.5;}
        \node[text=\constColor,rotate=0,anchor=south west] at (\x,\y) {\textbf{\uline{const avg degree}}};
        \tikzmath{\x=\x;\y=\y-1.0;}
        \node[text=\constColor,rotate=0,anchor=south west] at (\x,\y) {\textbf{$\langle d \rangle \rightarrow \displaystyle\frac{2\kern1pt\alphaP}{1-\big(\thetain+\thetaout\big)}$}};
        % \tikzmath{\x=\x;\y=\y-0.75;}
        \tikzmath{\x=\x;\y=\y-0.85;}
        \node[text=\constColor,rotate=0,anchor=south west] at (\x,\y) {\textbf{$p\big(d\big)  \propto  d^{-\gamma} \qquad \gamma  =  \displaystyle\frac{1+\thetain}{\thetain}$}};
        % \node[text=\constColor,rotate=0,anchor=south west] at (\x,\y) {\textbf{$\begin{aligned}
        % \langle d \rangle &= \displaystyle\frac{2\alpha}{1-\big(\thetain+\thetaout\big)}\\[-0.3em]
        % p\big(d\big)  &\propto  d^{-\gamma} \qquad \gamma  =  1+\displaystyle\frac{1}{\thetain}
        % \end{aligned}$}};



        %       \tikzmath{\y=\y-2.35;}
        % \renewcommand{\eqspace}{\kern2pt}
        % \node[text=\constColor,rotate=0] at (\x-1.52,\y-0.37) {\textbf{const avg degree}};
        % \node[text=\constColor,rotate=0] at (\x-1.62,\y-0.77) {\textbf{(\mbox{$\thetain+\thetaout<1$})}};
        % \node[text=\constColor,rotate=0] at (\x-0.52,\y-1.95) {\textbf{$\begin{aligned}
        % \langle d \rangle &= \eqspace \displaystyle\frac{2 \eqspace\alpha}{1-\big(\thetain+\thetaout\big)}\\[-0.3em]
        % p\big(d\big) \eqspace &\propto \eqspace d^{-\gamma} \qquad \gamma \eqspace = \eqspace 1+\displaystyle\frac{1}{\thetain}
        % \end{aligned}$}};

 % \node[text=\constColor,rotate=0] at (\x-1.75,\y-2.25) {$\begin{aligned}
 %        \langle d \rangle &= \eqspace \displaystyle\frac{2 \eqspace\alpha}{1-\big(\thetain+\thetaout\big)}\\[-0.35em]
 %        p\big(d\big) \eqspace &\propto \eqspace d^{-\gamma} \qquad \gamma \eqspace = \eqspace 1+\displaystyle\frac{1}{\thetain}
    
    \end{tikzpicture}
    \vspace{-5pt}
    \caption{\textbf{Sparsity and power-laws in the {\ppaAcron} model.} %\\
    % \textbf{Sparsity regimes for our {\ppaName} ({\ppaAcron}) model.}
    }
    \label{fig:PPARegimes}
\end{figure}
\spaceendsubsubsection
\subsubsection{Three Sparsity Regimes}
\spacebefsubsubsection
% There are three regimes for the average degree 
% This model has three regimes for the asymptotic ($n\longrightarrow\infty$) average degree {\avgdegree}, 
% While the limit of (growing) dense graphs is well-understood, 
% the limit
% Networks with power-law degree distributions are naturally sparse;  
% in order for the degrees to span arbitrarily many orders of magnitude, 
% ``most'' nodes have a vanishingly small degree compared to the maximum.  
% in the sense that the fraction of dyad variables that are edges limits to zero as the network grows in size: $E(n)/\binom{n}{2}=\littleO{1}$. 
% A sparse growing graph is defined asymptotically; 
% sparse graphs have fewer edges than a dense graph (in a strict asymptotic, \ie, the total number of edges $E(n)$ is $\littleO{n^2}$).  
Sparsity can be characterized in terms of the average degree as a function of the number of nodes ${\avgdegree = \frac{2E(n)}{n}}$.  
For dense networks, the average degree grows linearly in the number of nodes: ${\avgdegree = \bigO{n}}$, 
while for sparse networks, the average degree grows sublinearly, ${\avgdegree = \littleO{n}}$ \citep{van2024random}. 
% \citep{van2024random,bollobas2001degree}.  
%
Networks with power-law degree distributions are naturally sparse;  
in order for the degrees to span arbitrarily many orders of magnitude, 
``most'' nodes have a vanishingly small degree compared to the maximum.  
%*** this feels a bit disconnected/doesnt flow as well

Despite 
% its 
the apparent similarity of our {\ppaAcron} model to preferential attachment models that ``hard-code'' the average degree, % ${\avgdegree = \bigO{1}}$, 
our model exhibits an emergent phase transition to sparse scalings with growing average degree. 
% (see Fig.~\ref{fig:PPARegimes}, 
% and proof in Appendix~\ref{appendixDAPAProofs}
% \novo{all pro}
% **** footnote saying that if thetas are zero and alpha and beta 1 we get ER?  
\begin{theorem}[\thmNamePPA]
\label{thm:PPAPhaseTransition}
    The {\ppaAcron} model exhibits three qualitatively different asymptotic behaviors for the average degree $\avgdegree$. \\
    % \\[-12pt] %, 
    % depending on the value of ${\thetain+\thetaout}$ 
    % (see Fig.~\ref{fig:PPARegimes}, proof in Appendix~\ref{appendixProof3Regimes}):\\[-12pt]
    %Our {\ppaName} ({\ppaAcron}) model (\cref{eq:PPAprob,eq:PPAdin,eq:PPAdout}). 
    \def\TempKern{\kern4pt}
\begin{tabular}{llr}
    \vphantom{\Bigg(}constant:      &\TempKern$\displaystyle\frac{2\kern1pt\alphaP}{1-\big(\thetain+\thetaout\big)}$     &\TempKern$0<\thetain+\thetaout<1$ \\
    \vphantom{\Bigg(}logarithmic:   &\TempKern$2\kern1pt\alphaP \log\big(n\big) + C$                            &\TempKern$\thetain+\thetaout=1$ \\
    \vphantom{\Bigg(}polynomial:    &\TempKern$C \times n^{\thetain+\thetaout-1}$                     &\TempKern$1<\thetain+\thetaout<2$
\end{tabular}
\end{theorem}
% \vspace{-6pt}
% \begin{tabular}{lr}
    % Constant: ${\avgdegree = \bigTh{1}}$                            & ${\thetain+\thetaout<1}$ \\
    % Logarithmic: ${\avgdegree = \bigTh{\log(n)}}$                   & ${\thetain+\thetaout=1}$ \\
    % Polynomial: ${\avgdegree = \bigTh{n^{\thetain+\thetaout-1}}}$   & ${\thetain+\thetaout>1}$
    % \vphantom{\Bigg(}$\avgdegree \longrightarrow \displaystyle\frac{2\alpha}{1-\thetain-\thetaout}$     & $\quad 0<\thetain+\thetaout<1$ \\
    % \vphantom{\Bigg(}$\avgdegree \longrightarrow \alpha \log\big(n\big) + C$                            & $\quad \thetain+\thetaout=1$ \\
    % \vphantom{\Bigg(}$\avgdegree \longrightarrow C \times n^{\thetain+\thetaout-1}$                     & $\quad 1<\thetain+\thetaout<2$
% \def\TempKern{\kern4pt}
% \begin{tabular}{llr}
%     \vphantom{\Bigg(}constant:      &\TempKern$\displaystyle\frac{2\kern1pt\alphaP}{1-\big(\thetain+\thetaout\big)}$     &\TempKern$0<\thetain+\thetaout<1$ \\
%     \vphantom{\Bigg(}logarithmic:   &\TempKern$2\kern1pt\alphaP \log\big(n\big) + C$                            &\TempKern$\thetain+\thetaout=1$ \\
%     \vphantom{\Bigg(}polynomial:    &\TempKern$C \times n^{\thetain+\thetaout-1}$                     &\TempKern$1<\thetain+\thetaout<2$
% \end{tabular}
\spaceendsubsubsection
\subsubsection{And a Flexible Power-law Degree Distribution}
\label{sec:powerlawflexible}
\spacebefsubsubsection
For all three sparsity regimes, 
the degree distributions of the growing networks have \potentiallyImprove{a range of power-law scalings.}
\begin{theorem}[\thmNamePPAdegdist]
\label{thm:PPAPhaseTransitiondegdist}
    % Our {\ppaAcron} model exhibits three qualitatively different asymptotic behaviors for the average degree $\avgdegree$ \\[-12pt] %, 
   % Our {\ppaAcron} model exhibits three qualitatively different regimes for 
   The asymptotic probability that a random node \potentiallyImprove{has degree $d$ has a tail} of the form \mbox{$p(d) \propto d^{-\gamma}$}, where the scaling exponent depends on either $\thetain$ or $\thetaout$. \\
   \def\TempKern{\kern4pt}
   % \begin{tabular}{llr}
    % \vphantom{\Bigg(}\text{constant:}      &\TempKern$\displaystyle\gamma = \frac{1+\thetain}{\thetain}$     &\TempKern$0<\thetain+\thetaout\leq1$ \\
    % \vphantom{\Bigg(}polynomial:    &\TempKern$\displaystyle\gamma = \frac{2-\thetaout}{1-\thetaout}$                     &\TempKern$1\leq\thetain+\thetaout<2$
    \begin{tabular}{llr}
    \vphantom{\Bigg(}\text{constant:}      &\TempKern$\displaystyle\gamma = \frac{1+\thetain}{\thetain}$     &\TempKern$0<\thetain+\thetaout\leq1$ \\
    \vphantom{\Bigg(}polynomial:    &\TempKern$\displaystyle\gamma = \frac{2-\thetaout}{1-\thetaout}$                     &\TempKern$1\leq\thetain+\thetaout<2$
\end{tabular}
\end{theorem}
Note that these two expressions result in the same scaling exponent precisely when \mbox{$0<\thetain+\thetaout\leq1$}.  

\commentToDo{new citations from reviewer discussion: 
\citep{dereich2013random, sheridan2008preferential,wang2023poisson}}