

We conduct numerical experiments to illustrate the finite-sample properties of estimators for the path-specific PNS.





\begin{table*}[!tb]
\centering
\caption{Results of numerical experiments.
We present the estimates of $\text{\normalfont T-PNS}$, $\text{\normalfont ND-PNS}^{{M}}$, $\text{\normalfont NI-PNS}^{{M}}$, $\text{\normalfont PNS}^{X \rightarrow Y}$, $\text{\normalfont PNS}^{X \rightarrow {N}  \rightarrow Y}$, $\text{\normalfont PNS}^{X \rightarrow {M} \rightarrow {N}  \rightarrow Y}$, and $\text{\normalfont PNS}^{X \rightarrow {M}  \rightarrow Y}$.
%along with their respective upper and lower bounds. \jin{bounds???}
Additionally, we report the mean of each estimator accompanied by their 95\% confidence interval.}
\label{tab:f}
%\vspace{-0.2cm}
\scalebox{1}{
\begin{tabular}{c|cccc}
\hline
Estimators & $N=20$ & $N=100$ & $N=10000$ &  Ground Truth \\
\hline
\hline
$\text{\normalfont T-PNS}$  & $0.447$ ($[0.286,0.625]$) & $0.448$ ($[0.385,0.519]$) & $0.449$ ($[0.443,0.455]$) &$0.449$ \\
$\text{\normalfont ND-PNS}^{{M}}$  & $0.153$ ($[0.040,0.343]$) & $0.155$ ($[0.103,0.214]$) & $0.156$ ($[0.150,0.161]$) &$0.156$ \\
$\text{\normalfont NI-PNS}^{{M}}$  & $0.296$ ($[0.154,0.443]$) & $0.293$ ($[0.231,0.355]$) & $0.293$ ($[0.282,0.299]$) &$0.293$ \\
$\text{\normalfont PNS}^{X \rightarrow Y}$  & $0.057$ ($[0.003,0.153]$) & $0.059$ ($[0.032,0.093]$) & $0.059$ ($[0.056,0.062]$) &$0.059$ \\
$\text{\normalfont PNS}^{X \rightarrow {N}  \rightarrow Y}$   & $0.095$ ($[0.015,0.251]$) &$0.097$ ($[0.059,0.144]$) &  $0.097$ ($[0.093,0.101]$) &$0.097$ \\
$\text{\normalfont PNS}^{X \rightarrow {M} \rightarrow {N}  \rightarrow Y}$   & $0.133$ ($[0.037,0.287]$) &$0.134$ ($[0.096,0.180]$)  &  $0.134$ ($[0.130,0.139]$) &$0.135$ \\
$\text{\normalfont PNS}^{X \rightarrow {M}  \rightarrow Y}$   & $0.163$ ($[0.031,0.319]$)&$0.160$ ($[0.108,0.215]$)  &  $0.158$ ($[0.153,0.164]$) &$0.158$ \\
\hline
\end{tabular}
}
\end{table*}


{\bf Estimation methods.}
The path-specific PNSs for decomposition under SCM ${\cal M}^{L2}$ are estimable using simple linear regressions.
%$Y:=\alpha_0+\alpha_1 X+\alpha_2 {M}+\alpha_3 {N}+\alpha_4 C+U^Y$, ${N}:=\beta_0+\beta_1 X+\beta_2 {M}+\beta_3 C+U^{{N}}$, ${M}:=\gamma_0+\gamma_1 X+\beta_3 C+U^{{M}}$, where $U^Y\sim {\cal N}(0,\sigma^2_Y)$, $U^{{M}} \sim {\cal N}(0,\sigma^2_{{M}})$, and $U^{{N}} \sim {\cal N}(0,\sigma^2_{{N}})$ are independent normal distributions.
%The counterfactuals $Y_{x,{M}_{x'},{N}_{x'',{M}_{x'''}}}$ follows
$\theta(y;x,x',x'',x''',c)$ in Theorem \ref{theo41} is estimated by $\mathbb{P}(Z<y)$,
%\begin{equation}
%\begin{aligned}
%&\alpha_0+\alpha_1 X+\alpha_2 (\gamma_0+\gamma_1 x'+\beta_3 C+U^{{M}})\\
%&+\alpha_3 (\beta_0+\beta_1 x''+\beta_2 (\gamma_0+\gamma_1 x'''+\beta_3 C+U^{{M}})\\
%&+\beta_3 C+U^{{N}})+\alpha_4 C+U^Y\\
%&\hat{\rho}(y;x,x',x'',x''',c)=\\
%&\mathbb{P}(\alpha_0+\alpha_1 x+\alpha_2 \gamma_0+\alpha_2 \gamma_1 x'+\alpha_2 \beta_3 c+\alpha_3 \beta_0\\
%&+\alpha_3 \beta_1 x'' 
%+\alpha_3 \beta_2 \gamma_0+\alpha_3 \beta_2 \gamma_1 x'''+\alpha_3 \beta_2 \beta_3 c\\
%& + \alpha_3 \beta_2 \beta_3 c+\alpha_4 c+\alpha_2 U^{{M}} +\alpha_3 \beta_2 U^{{M}}\\
%&+\alpha_3 \beta_2 U^{{N}} + U^Y <y),
%\mathbb{P}(Z<y)
%\end{aligned}
%\end{equation}
where $Z \sim {\cal N}(
\hat{\alpha}_0+\hat{\alpha}_1 x+\hat{\alpha}_2 (\hat{\gamma}_0+\hat{\gamma}_1 x'+\hat{\gamma}_2 c)+\hat{\alpha}_3 (\hat{\beta}_0+\hat{\beta}_1 x''+\hat{\beta}_2 (\hat{\gamma}_0+\hat{\gamma}_1 x'''+\hat{\gamma}_2 c)+\hat{\beta}_3 c)+\hat{\alpha}_4 c,
\{(\hat{\alpha}_2+\hat{\alpha}_3 \hat{\beta}_2 )^2 \hat{\sigma}_{{M}}^2+\hat{\alpha}_3^2\hat{\sigma}_{{N}}^2+\hat{\sigma}_Y^2\}^{1/2})$
and
$\{\hat{\alpha}_0,\hat{\alpha}_1,\hat{\alpha}_2,\hat{\alpha}_3,\hat{\beta}_0,\hat{\beta}_1,\hat{\beta}_2,\hat{\gamma}_0,\hat{\gamma}_1,\hat{\sigma}_{Y},\hat{\sigma}_{{M}},\hat{\sigma}_{{N}}\}$ are the estimated parameters of the three linear regressions, $Y\sim X+{M}+{N}$, ${N} \sim X+{M}$, and ${M}\sim X$.


{\bf Setting.}
We consider the following SCM:
\begin{equation}
\begin{gathered}
Y:=X+{M}+ {N}+ C+U^Y,
{N}:=X+ {M}+ C+U^{{N}},\\
{M}:=X+C+U^{{M}}, X:=C+U^X, C:=U^C,
\end{gathered}
\end{equation}
where $U^C\sim {\cal N}(0,1)$, $U^X\sim {\cal N}(0,1)$, $U^Y\sim {\cal N}(0,1)$, $U^{{M}} \sim {\cal N}(0,1)$, $U^{{N}} \sim {\cal N}(0,1)$, which are mutually independent normal distributions.
This SCM satisfies Assumptions \ref{SCAS}, \ref{ASM}, \ref{SUP1}, \ref{AS1}, and 4.3'.
We let $x'=0$, $x=1$, $y=0$, $c=0$, and ${\cal E}=\emptyset$.
We simulate 1000 times with the sample size $N=20$, $N=100$, and $N=10000$.





{\bf Results.}
We present the results of each estimator in Table \ref{tab:f}.
%The ground truth of $\text{\normalfont PNS}^{X \rightarrow Y}$ is $0.059$, with the following estimates:
%\begin{center}
%\textbf{$N=20$}:\, \, \, \, $0.057$ (95\%CI: $[0.003,0.153]$),\\\vspace{0.1cm}
%\textbf{$N=100$}:\, \, \,  $0.059$ (95\%CI: $[0.032,0.093]$),\\\vspace{0.1cm}
%\textbf{$N=10000$}: $0.059$ (95\%CI: $[0.056,0.062]$).
%\end{center}
%The ground truth of $\text{\normalfont PNS}^{X \rightarrow {N}  \rightarrow Y}$ is $0.097$, with the following estimates:
%\begin{center}
%\textbf{$N=20$}:\, \, \, \, $0.095$ (95\%CI: $[0.015,0.251]$),\\\vspace{0.1cm}
%\textbf{$N=100$}:\, \, \,  $0.097$ (95\%CI: $[0.059,0.144]$),\\\vspace{0.1cm}
%\textbf{$N=10000$}: $0.097$ (95\%CI: $[0.093,0.101]$).
%\end{center}
%The ground truth of $\text{\normalfont PNS}^{X \rightarrow {M} \rightarrow {N}  \rightarrow Y}$ is $0.135$, with the following estimates:
%\begin{center}
%\textbf{$N=20$}:\, \, \, \, $0.133$ (95\%CI: $[0.037,0.287]$),\\\vspace{0.1cm}
%\textbf{$N=100$}:\, \, \,  $0.134$ (95\%CI: $[0.096,0.180]$),\\\vspace{0.1cm}
%\textbf{$N=10000$}: $0.134$ (95\%CI: $[0.130,0.139]$).
%\end{center}
%The ground truth of $\text{\normalfont PNS}^{X \rightarrow {M}  \rightarrow Y}$ is $0.158$, with the following estimates:
%\begin{center}
%\textbf{$N=20$}:\, \, \,  \, $0.163$ (95\%CI: $[0.031,0.319]$),\\\vspace{0.1cm}
%\textbf{$N=100$}:\, \, \,  $0.160$ (95\%CI: $[0.108,0.215]$),\\\vspace{0.1cm}
%\textbf{$N=10000$}: $0.158$ (95\%CI: $[0.153,0.164]$).
%\end{center}
All means of the estimates are close to the ground truth. 
However, for small sample sizes, the estimators exhibit large 95 $\%$ CIs, indicating higher variability in estimation.
%All means of the estimators are close to the ground truth. 
%However, estimators for small sample sizes have large 95 $\%$ CIs.




We provide three additional experiments in Appendix \ref{appD1} under the following conditions: (1) no effect between ${M}$ and ${N}$, (2) no effect between $\{{M},{N}\}$ and $Y$, and (3) only effect through $X$$\rightarrow$${M}$$\rightarrow$${N}$$\rightarrow$$Y$.
In the setting (1), $\text{\normalfont PNS}^{X \rightarrow {M} \rightarrow {N}  \rightarrow Y}$ is equal to $0$.
In the setting (2), $\text{\normalfont PNS}^{X \rightarrow {N}  \rightarrow Y}$, $\text{\normalfont PNS}^{X \rightarrow {M} \rightarrow {N}  \rightarrow Y}$, and $\text{\normalfont PNS}^{X \rightarrow {M} \rightarrow Y}$ are all equal to $0$.
In the setting (3), $\text{\normalfont PNS}^{X \rightarrow Y}$, $\text{\normalfont PNS}^{X \rightarrow {N}  \rightarrow Y}$, and $\text{\normalfont PNS}^{X \rightarrow {M} \rightarrow Y}$ are all equal to $0$.
These results offer intuitive decompositions of T-PNS.


{We provide a sensitivity analysis on the monotonicity assumption in Appendix~\ref{appD2} by introducing a non-monotonic term in SCM, i.e., $Y:=X+M+ N+ C+\alpha U^Y+ (1-\alpha) (U^Y)^4$, where $\alpha \in [0,1]$ controls the degree of the violation of the monotonicity.
We observe that the magnitude of bias increases with greater violations of the monotonicity.
We additionally report experimental results using logistic regression for binary outcomes in Appendix \ref{appD3}.
The estimates obtained from logistic regression are reliable when the sample size is large.}
