\documentclass{uai2024} 
\usepackage{amsfonts} 
\usepackage{hyperref}
\usepackage{xcolor}
\newcommand{\jin}[1]{\textcolor{blue}{#1}}
\newcommand{\yuta}[1]{\textcolor{red}{#1}}

\begin{document}



Thank you for your constructive comments and suggestions. They are  helpful for us to improve our paper. We will carefully incorporate them in the revised paper. In the following, your comments are first stated and then followed by our responses.

>Comment:
The abstract and beginning of section 3 emphasize that this paper “introduces a new concept, conditional average partial causal effect (CAPCE).” 
I find this claim to be an unnecessary overstatement.\
1a. In statistics, it is called the partial derivative of the variable importance curve. In econometrics, it is called the partial derivative of the nonparametric instrumental variable regression in the partially endogenous setting. There are more names.\
1b. A more accurate claim is that this paper studies CAPCE with unobserved confounding yet without separability, by introducing a new model.

Our response:
Thanks for the feedback. We will replace the sentence in the abstract "In this paper, we introduce conditional average partial causal effects (CAPCE)" with "In this paper, we study conditional average partial causal effects (CAPCE)." We will replace the first sentence in Section 3 "First, we introduce a new concept conditional average partial causal effect (CAPCE) to capture the heterogeneous causal effects of a continuous treatment" with "First, we formally define  conditional average partial causal effect (CAPCE) to capture the heterogeneous causal effects of a continuous treatment." We agree that the quantity represented by CAPCE has been implicitly studied in the literature. Still we think it's important to formally define and name this quantity.

>Comment:
The abstract and remark of section 3 emphasize that this paper’s identifying assumptions are “weaker than the assumption needed by existing work” for the nonparametric instrumental variable regression function with covariates. 
I am not sure the characterization of the previous separability assumptions is correct.\
2a. Figure 1 differs from what is typically meant by instrumental variables with baseline covariates. What is typically meant is that $W$ has no arrows into it, and $W$ has arrows out of it pointing to $Z$, $X$, and $Y$. 
Such a DAG is the partially endogenous setting in Newey and Powell (2003) and related work, which this paper holds up for comparison.\
2b. Figure 1 is something different. It may be a valid model to study, but the comparison does not seem right, and was a source of confusion for me. It seems this paper avoids separability but also changes the problem by changing the role of the covariates. Is the absent arrow from $W$ to $Z$ crucial to the identification argument? It is a stronger requirement on the instrument. On the other hand, allowing an arrow from $H$ to $W$ is a weaker requirement on the covariates. These differences should be discussed.

Our response: 

-Figure 1 is also a popular model for studying IV with covariates, e.g., in  [Huntington-Klein, 2020] and the following two papers:\
(1) Wu, A., et al. (2022). Instrumental variables in causal inference and machine learning: A survey.\
(2) Hartford, J., et al. (2017). Deep IV: A flexible approach for counterfactual prediction. In International Conference on Machine Learning. PMLR.\
(It seems Figure 1 is popular in the machine learning literature while economists typically allow the arrow from $W$ to $Z$ though they often don't use DAGs.) 

-We have discussed the IV model with an arrow from $W$ to $Z$ (Figure 3 in the Appendix) at the end of the Conclusion section with results given in Appendix A.2. CAPCE is still identifiable under Figure 3 (with the same assumption for identifying CAPCE under Figure 1). But estimating CAPCE under Figure 3 is more difficult as under Figure 3, we have to learn CAPCE=$E[\partial_x Y_{x}|w]$ as a function of $x$ for each $w \in \Omega_W$ respectively, while under Figure 1,  we can learn $E[\partial_x Y_x|w]$ directly as a function of $x$ and $w$.

-As you concerns, if there exists the arrow from $W$ to $Z$ (Figure 3 in Appendix) the conditions of Newey and Powell (2003) become $f_Y(X,W,H,u_Y)=f_Y^1(X,W,u_Y)+f_Y^2(H,u_Y)$ and
$E[f_Y^2(H,u_Y)|Z,W]=0$.
According to Theorem 3.1' in the Appendix, we can identify CAPCE under Fig 3 with the same assumptions of Theorem 3.1.
Thus, our identification assumptions are weaker than the assumptions needed by existing work, even in Figure 3.

>Comment: Within the context of the model of Figure 1, I do not see how the separability assumption stated as (2) implies the identifying expression stated immediately before the display.

Our response: This result is from [Newey and Powell, 2003]. In their notation, the separability conditions are $Y=f_Y^1(X,W)+e$ and
$E[e|Z,W]=0$. 

>Comment:
The title does not mention instrumental variables, which is where this paper’s contributions for CAPCE lie.

Our response: We plan to  change our title to
"Identification and Estimation of Conditional Average Partial Causal Effects via Instrumental Variable."



%Q5 Detailed Comments To The Authors:
%I will raise the score if these items are improved: the presentation of the covariate model; the discussion of separability; the framing of CAPCE being new versus this model for CAPCE being new.


%Our response: We discussed your questions about the covariate model in Section A.2 in the Appendix and our responses. Theorem 3.1' weakens the identification assumptions even under Fig 3. Fig 1 is not new, and our CAPCE identification assumptions are new.

\end{document}



%We will delete the word ``new". We name it conditional average partial causal effect (CAPCE) after the average partial causal effect (APCE) by Kawakami et al. (2023). Our paper is the first paper to identify CAPCE directly, not via $E[Y_x]$.

%Our response: 
%Fig 1 is not our new model, and a popular model in recent nonparametric IV literature used in Huntington-Klein (2020), (1) and (2).

%Letting $u_Y=\emptyset$ and $\epsilon=f_Y^2(H,u_Y)$, Separability conditions
%, $Y=f_Y(X,W,H,u_Y)=f_Y^1(X,W,u_Y)+f_Y^2(H,u_Y)$ and $E[f_Y^2(H,u_Y)|Z,W]=0$,
%can be rewritten as $Y=f_Y^1(X,W)+\epsilon$ and $E[\epsilon|Z,W]=0$. This coincides with the notations of Newey and Powell (2003).



%Comment 3:
%2a. Consider Figure 1 and display (1). This DAG differs from what is typically meant by instrumental variables with baseline covariates. What is typically meant is that $W$ has no arrows into it, and $W$ has arrows out of it pointing to $Z$, $X$, and $Y$. Such a DAG is the partially endogenous setting in Newey and Powell (2003) and related work, which this paper holds up for comparison.

%Our response: As stated in the Conclusion, we give the discussion and alternative identification for this situation in the Appendix.
%\jin{Yuta, In the IV with covariates literature, is Figure 1 or Figure 3 more commonly used? or which papers assumed Figure 1 and which papers assumed Figure 3?}

%\yuta{The papers such as Huntington-Klein (2020),\\(1) Wu, A., Kuang, K., Xiong, R., \& Wu, F. (2022). Instrumental variables in causal inference and machine learning: A survey. arXiv preprint arXiv:2212.05778. (Survey Paper)\\ and\\(2) Hartford, J., Lewis, G., Leyton-Brown, K., \& Taddy, M. (2017, July). Deep IV: A flexible approach for counterfactual prediction. In International Conference on Machine Learning (pp. 1414-1423). PMLR. (DeepIV)\\ also use DAG the same as Fig 1. Fig 1 is popular in causal inference or machine learning areas.}

%\yuta{Newey and Powell (2003) do not use DAG and just state the separability assumption. But, they allow the arrow $W$ to $Z$ as Fig 3. In economics, researchers do not use DAG but allow Fig 3.}


%Comment 4:
%2b. Clearly Figure 1 is something different. It may be a valid model to study, but the comparison does not seem right, and was a source of confusion for me. It seems this paper avoids separability but also changes the problem by changing the role of the covariates. Is the absent arrow from $W$ to $Z$ crucial to the identification argument? It is a stronger requirement on the instrument. On the other hand, allowing an arrow from $H$ to $W$ is a weaker requirement on the covariate. These differences should be discussed.


%Our response:
%We discussed it in Appendix A.2, and Theorem 3.1' shows that CAPCE is identifiable under the same assumptions even when the arrow from $W$ to $Z$ exists. We focused on Fig 1 since it is a popular and reasonable IV model with covariates. Our view of the assumptions is the opposite. The absent arrow from $W$ to $Z$ seems weak since both $W$ and $Z$ are observed; on the other hand, the absent arrow from $H$ to $W$ seems to be a strong assumption since $H$ is a hidden variable. Estimating CAPCE based on Theorem 3.1 in Fig 1 has the merit that we can learn $g(x,w)$ directly as a function of $X$ and $W$.