\documentclass{uai2024} 

\usepackage{amsfonts} 
\usepackage{hyperref}
\usepackage{xcolor}
\newcommand{\jin}[1]{\textcolor{blue}{#1}}
\newcommand{\yuta}[1]{\textcolor{red}{#1}}

\begin{document}


Thank you for your positive review!
In the following, your comments are first stated and then followed by our responses.

>Comment:
The identification equation (3) is derived under the structural equation model where the IV $Z$ is independent of the conditioning variables $W$. 
The case that handles the dependence of $Z$ on $W$ gives identification equation (42), which appears to be quite challenging to solve when $W$ contains multiple continuous variables. 
This was not investigated in the experiments.

Our response:
We will perform the experiments about estimating CAPCE based on Eq. (42) and add the results in the appendix of the revised version.  

>Comment: In the second column on page 3, close to the bottom, the constant $\kappa$ is undefined, and Equation (5) appears to have missing parts.

Our response: 
The constant $\kappa$ can take value that satisfies $\kappa>(1+d)/2$ where $d$ is the dimension of the vector $W$ (specified after Eq. (1)). 
In the experiments, we let $\kappa$ be $2$ and $l$ be $1$ in Eq. (5). 
This compactness restriction is also used in [Newey and Powell, 2003]. We don't find Eq. (5) has missing parts. 

>Comment: In the second column on page 4, about halfway, the symbols for test datasets are inconsistent, such as ${\cal D'}^{(1)}$ and $D^{(1)'}$.

Our response:
We will fix ${\cal D'}^{(1)}$ and ${\cal D'}^{(2)}$ to ${\cal D}^{(1)'}$ and ${\cal D}^{(2)'}$.

>Comment: How does the choice of the reference level $z_0$ impact the estimator, if at all?

Our response:
The choice of the reference point $z_0$ does not affect the consistency results or rate of convergence, but it may affect the variance of the estimator. In our experiments, we take the minimum value of $Z$ as  a standard reference point $z_0$. The choice of the  reference point $z_0$ did not affect the standard deviation of the estimators much in our experiments.

>Comment: How was the mean squared error defined for the simulation studies?

Our response:
MSE is computed as  $\frac{1}{N_1'}\sum_{i=1}^{N_1'}(\hat{g}(x_i^{(1)'},w_i^{(1)'})-g(x_i^{(1)'},w_i^{(1)'}))^2$ with test dataset ${\cal D}^{(1)'}$.

>Comment: The last few sentences of the real data analysis claim that there is a significant difference of APCE between individuals of different IQ levels. But the bootstrap results in Appendix G.5 suggest otherwise, except maybe the coefficient of $W^2X$ from PTSLS.

Our response:
Although the coefficients in Tables 11 and 12 appear to be small, when plugging in the IQ values, e.g. $W=80, W=120$, the differences in APCE values are significant.
For instance, Table 13 shows that for students of 8 years of education, APCE is $31.750$ for IQ 80  and  is $71.523$ for IQ 120, a significant difference.








%We remove the word ``significant" or ``significantly". There is a difference in APCE between individuals of different IQ levels according to the bootstrap means of estimates. \yuta{Figure 6 and Table 13 show that there is a difference APCE between individuals of different IQ levels. For example, APCE for students of 8 years of education and 60 IQ is $17.838$ and that for students of 8 years of education and 140 IQ is $97.384$. There are large differences in effects.}





\end{document}


%Thank you for your constructive comments and suggestions. They are very helpful for us to improve our paper. We will carefully incorporate them in the revised paper. 

%\yuta{The constant $\kappa$ is a arbitrary value that satisfies $\kappa>(1+d)/2$. This depends on how restricted the functional space is. A larger value of $\kappa$ restricts function more at the point the norm $\|(x,w^T)\|$ is large, and the estimated CAPCE becomes smaller at the point the norm $\|(x,w^T)\|$ is large.
%$d$ is the dimension of covariates.

%We plan to change ``Let $N'=N_1'+N_2'$" in the second column on page 4 to ``Let $N'=N_1'+N_2'$ and  $(z_1',\ldots,z_{N'}')=(z_1^{(1)'},\ldots,z_{N_1'}^{(1)'},z_1^{(2)'},\ldots,z_{N_2'}^{(2)'})$."
%\jin{Yuta, what are you  talking about? That is not what the reviewer asked.}

%\yuta{[I moved our previous response here. I have checked ``a reference point $z_0$ did not affect the SD", but I did not write the results in the paper.]}
