\label{sec-exp}
In this section, we perform experiments to illustrate the finite-sample performance of the estimators of the moments of causal effects.


%\yuta{{I have removed {Heckman1997}.}}


{\bf Estimation.}
%We generate $\{y^1_{k^1}\}_{k^1=1}^{N_1}, \{y^2_{k^2}\}_{k^2=1}^{N_2}, \dots, \{y^m_{k^m}\}_{k^m=1}^{N_m}$  by i.i.d. sampling from a uniform distribution $U[\text{min}(Y),\text{max}(Y)]$ for Monte Carlo integration.
%We use the i.i.d. sampled dataset ${\cal D}=\{X_i,Y_i\}_{i=1}^{N}$.
%, where $N=50$.
%from the distribution $\mathbb{P}(X,Y)$, which is induced by SCM ${\cal M}$.
%We calculate the emprical CDFs and expectations \citep{Vaart1998} by
%\begin{gather}
%\hat{\mathbb{P}}(Y<y|X=x)=\frac{\sum_{i=1}^N\mathbb{I}(Y_i<y,X_i=x)}{\sum_{i=1}^N\mathbb{I}(X_i=x)},\\
%\hat{\mathbb{E}}[Y|X=x]=\frac{\sum_{i=1}^NY_i\mathbb{I}(X_i=x)}{\sum_{i=1}^N\mathbb{I}(X_i=x)}
%\end{gather}
%for any $x \in \Omega_X$ and $y \in \Omega_Y$.
The family of moments of causal effects $\sigma^{(m)}, \sigma_L^{(m)}, \sigma_U^{(m)}, \sigma(i,j;k,h), \sigma_L(i,j;k,h), \sigma_U(i,j;k,h)$ in Theorems~\ref{theo1}-\ref{theo4} (and the central moments of causal effects $\bar{\sigma}^{(m)},\bar{\sigma}_L^{(m)}, \bar{\sigma}_U^{(m)}, \bar{\sigma}(i,j,k,h), \bar{\sigma}_U(i,j,k,h), \bar{\sigma}_L(i,j,k,h)$ in Appendices \ref{appB} and \ref{appC}) are estimable by plugging in the empirical CDFs and expectations \citep{Vaart1998} and calculating the integrals using the Monte Carlo integration method \citep{Press2007}.
Let $N$ be the sample size of the dataset, and let $N_1$, $N_2$, $N_3$, and $N_4$ be the numbers of points for Monte Carlo integration on $y_1$, $y_2$, $y_3$, and $y_4$.
We assume that
%\begin{assumption}[Boundness of $\Omega_Y$]
%\label{BOUD}
the domains of $Y$ and $Y-\mathbb{E}[Y|X=x]$ for any $x \in \Omega_X$ are bounded by $[a,b]$, 
which is required for Monte Carlo integration. 
The details of all estimators are shown in Appendix \ref{appCon}.
They are all consistent estimators as discussed in Appendix \ref{appCon}.



%{\bf Baseline for continuous outcome.}
%We compare our empirical CDF-based estimators %for a continuous outcome 
%with the estimators proposed by \citet{Heckman1997}.
%\citet{Heckman1997} proposed an estimation method for ICE in the case of a continuous outcome under the rank invariance assumption and applied it to estimating the variance of causal effects.
%Their method is applicable for estimating the moments of causal effects.


{\bf Simulation for the moments of causal effects.}
%Next, we perform experiments to illustrate finite-sample properties of the estimator.
We assume the following SCM (A):
\begin{gather}
Y:=-(X+1)U\mathbb{I}(XU\geq 0),\\
X \sim \text{Bern}({0.8}), U\sim \text{Unif}(-1,1),
\end{gather}
where $\text{Bern}(p)$ is a Bernoulli distribution with probability $p$, and $\text{Unif}(-1,1)$ is a uniform distribution over $[-1,1]$. 
This setting satisfies Assumptions \ref{ASEXO2} and \ref{MONO2}.
%This setting does not satisfy the rank invariance assumption.
%The domain of $Y$ is bounded within $[-2,0]$.
%The central moments of the causal effects $\overline{\mu}^{(m)}$ are equal to $\mathbb{E}[(-U+\mathbb{E}[U])^m]$ for $m=1,\dots$.
We simulate 1000 times with the sample size $N=20,100,1000$, respectively.
We let $N_1$, $N_2$, $N_3$, and $N_4$ all be 1000.


%\yuta{[I set the sample sizes for $X=0$ and $X=1$ to be unbalanced.]}
%\jin{Why do you insist on usng a SCM that satisfies the rank invariance assumption? Why insist on setting $U$ uniform?} \yuta{[If $U$ is a normal distribution, SCM does not violate the rank invariance assumption.  The rank invariance assumption is violated if $U$ is discrete.]}
%\jin{Why don't you select a SCM that violate the the rank invariance assumption? I believe monotoncity and rank invariance are different conditions even for continuous case.. }
%\jin{Why is the rank invariance assumption  violated if $U$ is discrete? $Y_1 =-2U, Y_0=-U$. It looks to me $-2U < -2a \equiv -U < -a$, therefore the assumption is satisfied. }





\begin{table*}[tb]
\centering
\caption{Results of numerical experiments for SCM (A). 
{We present the estimates of the second moments $\sigma^{(2)}$, third moments $\sigma^{(3)}$, and fourth moments $\sigma^{(4)}$ of causal effects along with their respective upper and lower bounds.
Additionally, we report the means of each estimator accompanied by their 95\% confidence intervals.}}
\label{tab:a2}
%\vspace{-0.25cm}
\scalebox{1}{
\begin{tabular}{c|cccc}
\hline
Estimators & $N=20$ & $N=100$ & $N=1000$ &  Ground Truth \\
\hline
\hline
$\sigma^{(2)}$  & $0.405 ([0.138,0.841])$ &  $0.373 ([0.215,0.659])$ & $0.335([0.289,0.418])$ &$0.333$ \\
$\sigma_U^{(2)}$  & $1.548 ([0.804,2.62])$ &$1.582 ([1.127,2.030])$ &  $1.647 ([1.485,1.769])$ &- \\
$\sigma_L^{(2)}$   & $0.108 ([0.000,0.679])$ &$0.005 ([0.000,0.018])$ &  $0.000 ([0.000,0.000])$ &- \\
%& $100$ & $0.373$ & $[0.215,0.659]$ &$0.333$ \\
%& $1000$ & $0.335$ & $[0.289,0.418]$ &$0.333$ \\
%$\sigma^{(2)}$ (\citep{Heckman1997})  &  $1.160 ([0.298,4.510])$ &$0.427 ([0.234,0.613])$ & $0.345 ([0.295,0.423])$  &$0.333$ \\
%& $100$ & $0.427$ & $[0.234,0.613]$ &$0.333$ \\
%& $1000$ & $0.345$ & $[0.295,0.423]$ &$0.333$ \\
\hline
$\sigma^{(3)}$   &  $-0.572 ([-1.679,0.093])$ & $-0.293 ([-0.750,-0.065])$ & $-0.245 ([-0.305,-0.186])$ &$-0.250$ \\
$\sigma_U^{(3)}$   & $0.087 ([-0.107,0.292])$ &$0.120 ([0.037,0.234])$ &  $0.126 ([0.074,0.177])$ &- \\
$\sigma_L^{(3)}$   & $-2.479 ([-5.381,-0.612])$ &$-3.175 ([-3.999,-1.978])$ &  $-3.412 ([-3.877,-3.113])$ &- \\
%& $100$ & $-0.293$ & $[-0.750,-0.065]$ &$-0.250$ \\
%& $1000$ & $-0.245$ & $[-0.305,-0.186]$ &$-0.250$ \\
%$\sigma^{(3)}$ (\citep{Heckman1997})  &  $-1.998 ([-10.714,-0.142])$ &$-0.332 ([-0.488,-0.232])$ & $-0.273 ([-0.345,-0.218])$&$-0.250$ \\
%& $100$ & $-0.332$ & $[-0.488,-0.232]$ &$-0.250$ \\
%& $1000$ & $-0.273$ & $[-0.345,-0.218]$ &$-0.250$ \\
\hline
$\sigma^{(4)}$   &  $0.963 ([0.066,6.922])$ & $0.194 ([0.061,0.384])$& $0.205 ([0.112,0.283])$ &  $0.200$ \\
$\sigma_U^{(4)}$  & $6.837 ([1.157,11.497])$ &$7.712 ([4.909,10.848])$ &  $8.093 ([7.214,8.878])$ &- \\
$\sigma_L^{(4)}$   & $0.008 ([0.000,0.085])$ &$0.000 ([0.000,0.000])$ &  $0.000 ([0.000,0.000])$ &- \\
%& $100$ & $0.194$ & $[0.061,0.384]$ &$0.200$ \\
%& $1000$ & $0.205$ & $[0.112,0.283]$ &$0.200$ \\
%$\sigma^{(4)}$ (\citep{Heckman1997})  &  $4.200 ([0.226,24.484])$ & $0.229 ([0.067,0.902])$ & $0.221 ([0.183,0.270])$  &$0.200$ \\
%& $100$ & $0.229$ & $[0.067,0.902]$ &$0.200$ \\
%& $1000$ & $0.221$ & $[0.183,0.270]$ &$0.200$ \\
\hline
\end{tabular}
}
\end{table*}



\begin{table*}[tb]
\centering
\caption{Results of numerical experiments for SCM (B).
{We present the estimates of the product moments of causal effects $\sigma(1,0;0,-1)$ along with their respective upper and lower bounds.
Additionally, we report the means of each estimator accompanied by their 95\% confidence intervals.}}
\label{tab:a}
%\vspace{-0.25cm}
\scalebox{1}{
\begin{tabular}{c|cccc}
\hline
Estimators & $N=20$ & $N=100$ & $N=1000$ &  Ground Truth \\
\hline
\hline
$\sigma(1,0;0,-1)$  & $-0.300 ([-0.437,-0.131])$ & $-0.323 ([-0.420,-0.239])$ & $-0.327 ([-0.419,-0.260])$ &$-0.333$ \\
%& $100$ & $-0.323$ & $[-0.420,-0.239]$ &$-0.333$ \\
%& $1000$ & $-0.327$ & $[-0.419,-0.260]$ &$-0.333$ \\
$\sigma_U(1,0;0,-1)$   & $-0.154 ([-0.521,0.000])$ &$-0.105 ([-0.217,-0.029])$ &  $-0.168 ([-0.222,-0.112])$ &- \\
$\sigma_L(1,0;0,-1)$   & $-0.352 ([-0.559,-0.100])$ &$-0.390 ([-0.583,-0.278])$  &  $-0.338 ([-0.409,-0.260])$ &- \\
\hline
\end{tabular}
}
\end{table*}


{\bf Results.}
We present the estimates obtained using our proposed methods
%and obtained using the method of \citep{Heckman1997} 
in Table \ref{tab:a2}.
All means of the estimators are close to the ground truth for $N=1000$. 
All ground truth values lie within the computed bounds. 
However, estimators for small sample sizes have large 95 $\%$ CIs, %and they show slow convergence to the ground truth from the point of view of the 95 \% CIs, 
especially for high-order moments.
%Our estimator is more efficient than the estimators proposed by \citep{Heckman1997}, particularly when $N=20$ and $N=100$.
%We present the estimated bounds on the moments of causal effects in Table \ref{tab:a}.

%{\bf Results (Ours).}
%We present the estimates obtained using our proposed method.
%The ground truth of the second moment of $Y_1-Y_0$ is $0.333$, and the estimates of the second moment are
%\begin{center}
%\textbf{$N=20$}:\, \, \,  $0.405$ (95\%CI: $[0.138,0.841]$),\\\vspace{0.1cm}
%\textbf{$N=100$}:\, \,  $0.373$ (95\%CI: $[0.215,0.659]$),\\\vspace{0.1cm}
%\textbf{$N=1000$}:  $0.335$ (95\%CI: $[0.289,0.418]$).
%\end{center}
%The ground truth of the third moment of $Y_1-Y_0$ is $-0.250$, and the estimates of the third moment are
%\begin{center}
%\textbf{$N=20$}:\, \, \, \, \, $-0.572$ (95\%CI: $[-1.679,0.093]$),\\\vspace{0.1cm}
%\textbf{$N=100$}:\, \, $-0.293$ (95\%CI: $[-0.750,-0.065]$),\\\vspace{0.1cm}
%\textbf{$N=1000$}: $-0.245$ (95\%CI: $[-0.305,-0.186]$).
%\end{center}
%The ground truth of the fourth moment of $Y_1-Y_0$ is $0.200$, and the estimates of the fourth moment are
%\begin{center}
%\textbf{$N=20$}:\, \, \, $0.963$ (95\%CI: $[0.066,6.922]$),\\\vspace{0.1cm}
%\textbf{$N=100$}:\, \,  $0.194$ (95\%CI: $[0.061,0.384]$),\\\vspace{0.1cm}
%\textbf{$N=1000$}: $0.205$ (95\%CI: $[0.112,0.283]$).
%\end{center}
%All means of the estimators are close to the ground truth. 
%However, estimators for small sample sizes have large 95 $\%$ CIs, and they show slow convergence to the ground truth from the point of view of the 95 \% CIs, especially for high-order moments.


%We present the estimated bounds on the moments of causal effects.
%We estimate bounds of the moment when $N=1000$.
%\begin{center}
%\textbf{Upper bound of the second moment}:\\
%$1.647$ (95\%CI: $[1.485,1.769]$),\\\vspace{0.1cm}
%\textbf{Lower bound of the second moment}:\\
%$0.000$ (95\%CI: $[0.000,0.000]$),\\\vspace{0.1cm}
%\textbf{Upper bound of the third moment}:\\
%$0.126$ (95\%CI: $[0.074,0.177]$),\\\vspace{0.1cm}
%\textbf{Lower bound of the third moment}:\\
%$-3.412$ (95\%CI: $[-3.877,-3.113]$),\\\vspace{0.1cm}
%\textbf{Upper bound of the fourth moment}:\\
%$8.093$ (95\%CI: $[7.214,8.878]$),\\\vspace{0.1cm}
%\textbf{Lower bound of the fourth moment}:\\
%$0.000$ (95\%CI: $[0.000,0.000]$).
%\end{center}
%All ground truth values lie within the computed bounds.


%{\bf Results (\citep{Heckman1997}).}
%We present the estimates obtained using the method of \citep{Heckman1997}.
%The ground truth of the variance of $Y_1-Y_0$ is $0.083$, and 
%The estimates of the second moment are
%\begin{center}
%\textbf{$N=20$}:\, \, \, $1.160$ (95\%CI: $[0.298,4.510]$),\\\vspace{0.1cm}
%\textbf{$N=100$}:\, \,  $0.427$ (95\%CI: $[0.234,0.613]$),\\\vspace{0.1cm}
%\textbf{$N=1000$}: $0.345$ (95\%CI: $[0.295,0.423]$).
%\end{center}
%The ground truth of the skewness of $Y_1-Y_0$ is $0$, and 
%The estimates of the third moment are
%\begin{center}
%\textbf{$N=20$}:\, \,  $-1.998$ (95\%CI: $[-10.714,-0.142]$),\\\vspace{0.1cm}
%\textbf{$N=100$}:\, \, $-0.332$ (95\%CI: $[-0.488,-0.232]$),\\\vspace{0.1cm}
%\textbf{$N=1000$}: $-0.273$ (95\%CI: $[-0.345,-0.218]$).
%\end{center}
%The ground truth of the kurtosis of $Y_1-Y_0$ is $1.8$, and 
%The estimates of the fourth moment are
%\begin{center}
%\textbf{$N=20$}:\, \, $4.200$ (95\%CI: $[0.226,24.484]$),\\\vspace{0.1cm}
%\textbf{$N=100$}:\, \,  $0.229$ (95\%CI: $[0.067,0.902]$),\\\vspace{0.1cm}
%\textbf{$N=1000$}: $0.221$ (95\%CI: $[0.183,0.270]$).
%\end{center}






{\bf Simulation for the product moments of causal effects.}
We assume the following SCM (B):
\begin{equation}
Y:=X^2U, U\sim \text{Unif}(0,1),
\end{equation}
where $X$ takes values in $\{-1, 0, 1\}$ with the probabilities $\mathbb{P}(X=-1)=\mathbb{P}(X=0)=\mathbb{P}(X=1)=1/3$.  
The domain of $Y$ is bounded within $[0,1]$.
%The covariance of the causal effect $\overline{\rho}_{i,j;k,h}$ is equal to $\mathbb{E}[-(U-\mathbb{E}[U])^2]$.
%The variances of the causal effects $\mathbb{E}[\{(Y_1-Y_0)-(\mathbb{E}[Y_1]-\mathbb{E}[Y_0])\}^2]$ and $\mathbb{E}[\{(Y_0-Y_{-1})-(\mathbb{E}[Y_0]-\mathbb{E}[Y_{-1}])\}^2]$ are equal to $\mathbb{E}[(-U+\mathbb{E}[U])^2]$.
This setting satisfies Assumptions \ref{ASEXO2} and \ref{MONO2}.
We simulate 1000 times with the sample size $N=20,100,1000$, respectively.
We let $N_1$ and $N_2$ both be 1000.
%\citet{Heckman1997} did not study the product moments of causal effects.


{\bf Results.}
We present the estimates for $\mathbb{E}[(Y_1-Y_0)(Y_0-Y_{-1})]$ %obtained using our proposed method 
in Table \ref{tab:a}.
All means of the estimators are close to the ground truth. 
The ground truth value lies within the computed bounds.
However, estimators for small sample sizes have large 95 $\%$ CIs.
%We present the estimated bounds on the moments of causal effects. \yuta{We estimate bounds of the moment when $N=20, 100, 1000$.}

Overall, the results show that, as the sample size increases, the estimates are close to the ground truths.
{Figures~\ref{fig:ap1} and~\ref{fig:ap2} in Appendix \ref{appE1} present the plots of the estimates obtained from the numerical experiments.}



%The ground truth of the covariance of $Y_1-Y_0$ and $Y_0-Y_{-1}$ is $-0.333$, and the estimates of the product moment are
%\begin{center}
%\textbf{$N=20$}:\, \, \,  $-0.300$ (95\%CI: $[-0.437,-0.131]$),\\\vspace{0.1cm}
%\textbf{$N=100$}:\, \,   $-0.323$ (95\%CI: $[-0.420,-0.239]$),\\\vspace{0.1cm}
%\textbf{$N=1000$}: $-0.327$ (95\%CI: $[-0.419,-0.260]$).
%\end{center}
%All means of the estimators are close to the ground truth. 
%However, estimators for small sample sizes have large 95 $\%$ CIs.
%We present the estimated bounds on the moments of causal effects.
%We estimate bounds of the moment when $N=1000$.
%\begin{center}
%\textbf{Upper bound of the product moment}:\\
%$-0.168$ (95\%CI: $[-0.222,-0.112]$),\\\vspace{0.1cm}
%\textbf{Lower bound of the product moment}:\\
%$-0.338$ (95\%CI: $[-0.409,-0.260]$).
%\end{center}
%The ground truth value lies within the computed bounds.






%We present additional experiments for a discrete outcome in Appendix \ref{appF2}.


