\documentclass{uai2024} 


\usepackage{amsfonts} 
\usepackage{hyperref}
\usepackage{xcolor}
\newcommand{\jin}[1]{\textcolor{blue}{#1}}
\newcommand{\yuta}[1]{\textcolor{red}{#1}}

\begin{document}

Thank you for your constructive comments and suggestions. They are helpful for us to improve our paper. We will carefully incorporate them in the revised paper. In the following, your comments are first stated and then followed by our responses.

>Comment: 
Maybe there should be more explanations about the definition of PoC in continuous case. 
Why is it so defined, any intuition about it?
Any other methods to define PoC, how about define it as the follows? $PN(y;x_0,x_1)=P(Y_{x_0}<y|Y=y,X=x_1)$, $PS(y;x_0,x_1)=P(y \leq Y_{x_1}|Y=y,X=x_0)$. 
why the definition in paper should be chosen?

Our response:
Our definition focuses on the necessity/sufficiency of treatment $x_1$ w.r.t. $x_0$ to produce the event $(Y\geq y)$. As discussed under Eq. (5), considering a binary outcome variable $O=\mathbb{I}(Y \geq y)$, the definition reduces to the standard definition of binary PoC over $X$ and $O$. Additionally, if $Y$ is binary with values $y_0 < y_1$, then the definition with $y=y_1$ reduces to the standard binary PoC. The proposed $PN(y;x_0,x_1)=P(Y_{x_0}<y|Y = y,X = x_1), PS(y;x_0,x_1)=P(y \leq Y_{x_1}|Y=y,X=x_0)$ do not naturally reduce to the standard binary PoC under these settings.

>Comment: 
When treatment and outcome variables are vector variables, the PNS, PN and PS are defined with potential outcome $Y_{x_1}$ or $Y_{x_0}$ , in the form that all treatments are intervened. How about just intervening only a subset of all treatment variables? This scenario is also useful in real life.

Our response: 
Thank you for pointing out new research direction. 
We have not studied the situation about just intervening only a subset of all treatment variables.
We plan to add the following sentence in Conclusion.

``In this paper, we focus on the form of PoC where all treatments are intervened. 
The scenario of just intervening only a subset of all treatment variables is also useful in real life.
This will be future work."

>Comment: 
There may be some errors in the proof. 
Some core errors are listed below: \
(2a) In the proof of Lemma C.1, if Assumption 3.4 (Monotonicity over $f_Y$) is obeyed, here is one case where it is obeyed: 
there exists a set $U$ such that $0<P(U)<1$, and one $x_1 \in \Omega_X$, such that, the function $f_Y(x_1,U)$ is monotonically increasing on $U$, and one $x_0\in \Omega_X$, such that, the function $f_Y(x_0,U)$ is monotonically decreasing on $U$.
If $u_0<u_1$, $f_Y(x_1,u_0) <y_1<f_Y(x_1,u_1)$, and $f_Y(x_0,u_0)>y_2>f_Y(x_0,u_1)$.
There is no need to let $y_1=y_2$, but it is required in the paper.
In fact, it is easy to imagine one monotonically increasing function and another monotonically decreasing function with no overlap $y$ values. 
Then the implications for violations of Assumption 3.4 that are shown in the proof of Lemma C.1 will not exist.

Our response: 
Thank you for pointing this out. 
Lemma C.1 needs the following Assumption A: 
``Potential outcome $Y_x$ has PDF $p_{Y_x}$ for each $x \in \Omega$, and its support $\\{y \in \Omega_Y: p_{Y_x}(y) \ne0 \\}$ is $[-\infty,\infty]$ for each $x \in \Omega$." 

This is reasonable for continuous variables.
For example, the multivariate linear regression model with Gaussian noise in [Hannart and Naveau 2018] satisfies this assumption.
Assumption A guarantees the existence of overlap $y$ values since ``no overlap" situation $\\{f_Y(x_0,u):u \in \Omega_U\\}\cap \\{f_Y(x_1,u):u \in \Omega_U\\}=\emptyset$ means the intersection of the support of $Y_{x_0}$ and the support of $Y_{x_1}$ is empty, which violates Assumption A.
Therefore, the negation of Assumption 3.4 holds in the proof of Lemma C.1.

We will also add the same assumption for vector variables model.

>Comment: 
(2b) In the proof of Lemma C.4, the author wants to derive the “Strict monotonicity over $f_Y$” from Rank Preservation Condition. 
But in the proof, only two values $u_0,u_1$ are considered and it proves that the direction of inequality does not change. 
But consider the scenario that $u_0<u_1<u_2$, but $f_Y(x,u_1)>f_Y(x,u_0)>f_Y(x,u_2)$ for all $x$, it is rank preserved, but not monotonic.

Our response: 
You are right! The rank preservation assumption does not imply monotonicity. We will remove all mention/discussion of the rank preservation assumption from the paper.

>Comment: 
Insufficient literature review: I just find two recent papers on causal attribution with multiple treatments and outcomes, but they are not cited in this paper: \
(1) Lu Zitong, Geng Zhi, Li Wei, Zhu Shengyu and Jia Jinzhu (2023). Evaluating causes of effects by posterior effects of causes. Biometrika, 110(2): 449-465. \
(2) Li Wei, Lu Zitong, Jia Jinzhu, Xie Min, and Geng Zhi (2023). 
Retrospective causal inference with multiple effect variables. Biometrika. To appear.

Our response:
Thank you for letting us know these papers. 
We will cite these works. 
Note that our PoC with multiple outcome  variables differs from these works because we deal  with continuous variables.


\end{document}




%Comment 1: There exist some errors in the proof of the theorems, which may lead to incorrect theorems (forgive if I am wrong). Details are given in Q5.


%Our response: 
%We answer them later.




%The reason why we choose our definition of PN and PS is we focus on the necessity or sufficiency of treatment $x_1$ w.r.t. $x_0$ to produce the event $(Y\geq y)$, not $(Y=y)$.
%We also discussed your definitions of PN and PS, $P(Y_{x_0}<y|Y = y,X = x1)$ and $P(y \leq Y_{x_1}|Y=y,X=x_0)$, in Section 5.1. \jin{In what sense are they discussed in Section 5.1?}

%\yuta{We agree with our proof of Lemma C.1 in Appendix is incorrect.}\jin{Yuta, are you agreeing with the reviewer that there is an error?}



%\yuta{Weak Version: ``Potential outcome $Y_x$ has PDF $p_{Y_x}$ for each $x \in \Omega$, and the intersection of the supports $\cap_{x \in \Omega_X}\{y \in \Omega_Y: p_{Y_x}(y)\ne 0\}$ is an interval $(a,b)$ ($a < b$), which means the intersection of the supports is not empty or a point." }



%\jin{Can this assumption be relaxed, e.g. $P_U$ almost?}\yuta{Is it related to $x$? If you mean $P_X$ almost, we will fix $x_0$ or $x_1$ and we can not relax $P_X$ almost every $x$.} \jin{Requiring support to be $[-\infty,\infty]$ is strong.}
%The support of function $f$ means $\{x:f(x)\ne0\}$.


%\yuta{This assumption implies``the intersection of the open sets $(a_x,b_x)$ is not empty: $$\cap_{x \in \Omega_X} (a_x,b_x) \ne \emptyset,$$ where $(a_x,b_x)$ is the intersaction of the open sets $(c,d)$ such that $\mathbb{P}(c<Y_x<d)=1$ for each $x \in \Omega_X$." $(a_x,b_x)$ is the tightest open set which satisfies $\mathbb{P}(a_x<Y_x<b_x)=1$ for each $x$.}


 %\jin{how/why? further explanation?} 
%\yuta{Given $x_0,x_1$, the situation ``no overlap" means the following situations: (i) ``there exists $y$ such that $f_Y(x_0,u)<y<f_Y(x_1,u)$ for almost every $u$" and (ii) ``there exists $y$ such that $f_Y(x_1,u)<y<f_Y(x_0,u)$ for almost every $u$". Both (i) and (ii) mean 
%$P(f_Y(x_0,u)<y)=1$ and $P(f_Y(x_1,u)>y)=1$, and this means  $\{f_Y(x_0,u):u \in \Omega_U\}\cap \{f_Y(x_1,u):u \in \Omega_U\}=\emptyset$ (no overlap).}
%\yuta{(i) means the support of $Y_{x_0}$ is in $[-\infty,y)$ and the support of $Y_{x_0}$ is in $(y,-\infty]$ and (ii) means the support of $Y_{x_0}$ is in $(y,-\infty]$ and the support of $Y_{x_1}$ is in $[-\infty,y)$. Both violates Assumption A.}  


%\yuta{By adding the following assumption, ``no overlap $y$ values" does not happen. Hannart and Naveau's multivariate linear regression model with Gaussian noise satisfies this assumption.}



%We think \jin{What do you mean by "we think"? There should be a clear cut definition/answer. What "we think" doesn't matter. } 
%When there is ``no overlap $y$ values", the following situation happens (i) ``there exists $y$ such that $f_Y(x_0,u)<y<f_Y(x_1,u)$ for almost every $u$" and (ii) ``there exists $y$ such that $f_Y(x_1,u)<y<f_Y(x_0,u)$ for almost every $u$" given $x_0,x_1$.(i) implies $P(Y_{x_0}>Y_{x_1})=0$, and (ii) implies $P(Y_{x_0}>Y_{x_1})=0$.And, they also imply Assumption 3.3.

%\yuta{We will fix our proof of Lemma C.1 by adding the following sentence.}
%Theorems in our paper are correct if we add the above discussion. \jin{A theorem is either correct or not - it won't go from wrong to correct due to some discussion. Are you changing (the statement of) the theorem because there is an issue with the original theorem or are you just clarifying the proof (an error in proof?) but the original theorem is correct?}



%Thank you for pointing this out. \yuta{We will add the following rank preservation condition to Assumption 3.6: ``The rank of $Y_x$ preserves the rank of $U$." The situation $f_Y(x,u_1)>f_Y(x,u_0)>f_Y(x,u_2)$ and $u_0<u_1<u_2$ violates this condition. This condition is reasonable since it can be achieved just re-arranges the elements of $\Omega_U$ in the order of $Y_x$. If $f_Y(x,u_1)>f_Y(x,u_0)>f_Y(x,u_2)$ and $u_0<u_1<u_2$, by considering other total order $<'$ that satisfies $u_1<'u_0<'u_2$, the condition holds.}
%\jin{I don't find this "reasonable" claim makes sense. In fact, the assumption ``The rank of $Y_x$ preserves the rank of $U$" is essentially the monotonic assumption 3.5. }


%\yuta{Lemma C.4. is incorrect. We will delete the rank preservation assumptions (Assumption 3.6 and 4.5) from this paper since they have nothing to do with the monotonicity assumptions.}

%\yuta{ We will fix Theorem 3.1 as follows:\\ Under SCM ${\cal M}_S$ and Assumption A, Assumptions 3.3 and 3.4 are equivalent. Assumptions 3.5 is a strictly stronger requirement than 3.4.\\ We will fix Theorem 4.1 in the same way. We will delete Assumption 3.6 and 4.5 from Theorems 3.2, 4.2, 5.1, 5.2, 5.3 and Corollaries 5.1 and 5.2.}



%We think 
%$f_Y (x,u_1)>f_Y(x,u_0)>f_Y(x,u_2)$ ($u_0<u_1<u_2$) is also monotonic (decreasing) on $U$ from the definition of monotonic function. 
%\jin{This scenario is NOT monotonic! }
%We only consider two values $u_0,u_1$ since Assumption 3.5 requires only two arbitrary values. \jin{Are there errors in the proof or the theorem or not?}
%\yuta{Thus, our proof of Lemma C.4. is correct.}

