

{\it Probabilities of causation} (PoC) are a family of probabilities quantifying whether one event was the real cause of another in a given scenario \citep{Robins1989,Pearl1999,Tian2000,Pearl09,Kuroki2011,Dawid2014,Dawid2016,Dawid2017,Murtas2017,Hannart2018,Shingaki2021,Kawakami2023b}.
PoC are valuable quantities for decision-making \citep{Li2019,Li2022b} and  for explainable artificial intelligence (XAI) that aims to reduce the opaqueness of AI-based decision-making systems \citep{Galhotra2021,Watson2021}.
%PoC is important not only in artificial intelligence but also in meteorology \citep{Hannart2018} and in law \citep{Dawid2017b}.
\citet{Pearl1999} introduced three types of PoC over binary events, namely 
%There are three types of PoC regarding treatment ($X$) and outcome ($Y$), named 
the probability of necessity and sufficiency (PNS), the probability of necessity (PN), and the probability of sufficiency (PS). 
They are defined based on the joint probability distribution of two potential outcomes. 
\citet{Tian2000} provided the bounds of PNS, PN, and PS in terms of observational and experimental data and showed that PNS, PN, and PS are  identifiable under the assumptions of exogeneity and monotonicity.
The problem of bounding PoC was further extended in \citep{Li2019,Li2022b,Li2023,MuellerLiPearl}. However, all these works are restricted to binary treatment and outcome. 
More recently, \citet{Li2022,Li2022c} extended the problem of bounding PoC to multi-valued discrete treatment and outcome and provided  bounds for various variants of PoC.


In this paper, we aim to extend the concept of PoC to continuous treatment and outcome. 
There is considerable interest in continuous treatment and outcome  
%Recently, there has been a growing interest in a continuous treatment beyond a binary treatment 
in causal inference \citep{Imbens2004,Kennedy2017,Bahadori2022}, e.g., dose-response studies \citep{Wong1996,Emilien2000,Ivanova2008} and policy evaluations with continuous actions \citep{Kallus2018,Krishnamurthy2019,Majzoubi2020}.
%\citep{Tian2000} showed bounds of PNS, PN, and PS and gave the identification theorem of PNS, PN, and PS under assumptions of exogeneity and monotonicity. \citep{Li2022} extended their results to a multi-valued discrete treatment.
For instance, doctors want to know the dose-response relationship between the amount of insulin and the blood sugar level. 


We provide a nonparametric identification theorem for each type of PoC we introduced. The identification of binary PoC relies on a monotonicity assumption \citep{Tian2000}. 
We generalize the monotonicity assumption over binary treatment and outcome to continuous settings.
We discuss the relationship of our proposed monotonicity assumption with another commonly used assumption in the causal inference literature - monotonicity over structural functions \citep{Heckman1999,Vytlacil2002,Heckman2005,Chernozhukov2005,Chernozhukov2007,Imbens2009}.
%, and rank preservation assumption \citep{Robins1989,Robins1991,Have2007,Vansteelandt2014,Bothmann2023,Hernan2023}. 

%\yuta{[Comment: I have deleted all rank preservation assumptions.]}


We further extend the concept of PoC to capture causal effects between multiple treatments and multiple outcomes, which are drawing growing interests \citep{Kang1990,Zhang1998,Sammel1999,Segal2011,Lee2012,Kennedy2019b,Rimal2019}.
For instance, \citet{Hannart2018} investigated causal links between anthropogenic forcings, e.g., greenhouse gases (carbon dioxide, methane, nitrous oxide, halocarbons) emission and deforestation, and the observed climate changes, e.g., spatial–temporal vector of  Earth surface temperature. 
They used a multivariate linear regression model with Gaussian noise to evaluate PoC. 

%In addition, \citep{Hannart2018} considered continuous and multiple treatments and outcomes under the parametric Gaussian setting. We discuss the identification assumptions of PoC applicable to both discrete and continuous cases in this paper.

We also introduce more complicated variants of PoC and provide identification theorems for them.
They include PoC for a sub-population with specific covariates information considered by \citep{Li2022b} and PoC with multi-hypothetical terms  studied by \citet{Li2022} for discrete treatment and outcome. These variants of PoC capture more sophisticated counterfactual information useful for decision-making. 


Finally, we show an application of our results to a real-world dataset on education.

\begin{comment}
To begin with, we discuss nonparametric identification problems of PoC with a single discrete or continuous treatment ($X$), a single outcome ($Y$), and a single latent exogenous factor ($U$) using \emph{a scalar structural causal model}, i.e., $Y:=f_Y(X,U)$.
Three types of PoC for a non-binary treatment and outcome are easily defined using the cumulative distributional function (CDF) following the definition on \citep{Tian2000}.
%Three types of PoC are identifiable by some assumptions.
Historically, there are four widely used assumptions in causal inference.
We note that IV literature often discusses these assumptions on the joint probability distribution of potential outcomes in the relationship between an instrumental variable (IV) ($Z$) and a treatment variable ($X$), i.e., $\mathbb{P}(X_{z_0},X_{z_1})$.


First, \emph{monotonicity on potential outcomes} is the monotonicity of the relationship between two potential outcomes and is used in the computer science field \citep{Balke1997,Tian2000,Jung2021} and in economics \citep{Imbens1994,Angrist1996,Angrist2009}.
%in the studies about randomized controlled experiment with non-compliance.
Second, \emph{monotonicity on SCM} is the monotonicity for the structural function $f_Y$ on $U$. 
This assumption has appeared in the IV literature in economics, especially the latent index model related to the marginal treatment effect (MTE) \citep{Heckman1999,Vytlacil2002,Heckman2005}.
Third, \emph{strict monotonicity on SCM} is the strict monotonicity of the structural function $f_Y$ on $U$. 
This assumption has appeared in the IV literature in economics, especially non-separable IV models for continuous treatment \citep{Chesher2003,Chernozhukov2005,Chernozhuk
ov2007,Imbens2009}.
Finally, \emph{rank preservation assumption} is the assumption that rank on one potential outcome $Y_x$ is preserved in the other potential outcome $Y_{x'}$, and was often used in medicine, epidemiology, and statistics \citep{Robins1989,Robins1991,Have2007,Vansteelandt2014,Bothmann2023,Hernan2023}.
We will explain the relationship of these four assumptions and give an identification theorem of PoC using conditional CDF based on these assumptions.

Recently, there also has been growing interest in multiple outcomes \citep{Kang1990,Zhang1998,Sammel1999,Segal2011,Lee2012,Kennedy2019b,Rimal2019}.
We deal with multivariate PoC based on a \emph{totally ordered vector structural causal model with subject's covariates}, i.e., ${\boldsymbol Y}:=f_{\boldsymbol Y}({\boldsymbol X},{\boldsymbol C},{\boldsymbol U})$, as a generalization of a scalar structural causal model.
\citep{Hannart2018} discuss multivariate PoC based on the Gaussian vector structural causal model.
It is sometimes difficult to compare the values of multiple variables.
Thus, we introduce \emph{total order} ``$\preceq$'' \citep{Harzheim2005} for exogenous variables both ${\boldsymbol U}$ and outcomes ${\boldsymbol Y}$.
Then, we give an identification theorem of multivariate PoC using conditional CDF with assumptions based on the total order.



Furthermore, we consider more complicated types of  PoC.
First, we introduce the PNS with evidence $({\boldsymbol y}',{\boldsymbol x}',{\boldsymbol c})$, which is realized values of the subject's outcome, treatment, and IV respectively.
Unlike the case of binary outcome and treatment,
we sometimes have the evidence $({\boldsymbol y}',{\boldsymbol x}',{\boldsymbol c})$.
Second, we introduce PNS with multi-hypothetical terms as \citep{Li2022}.
Third, we introduce PNS with multi-hypothetical terms and evidence $({\boldsymbol y}',{\boldsymbol x}',{\boldsymbol c})$ combining with first and second ones.
We give identification theorems using conditional CDF with assumptions based on the total order respectively.
Finally, we show an application of our results to a real-world dataset on education.
We pick up a dataset about student performance in mathematics in secondary education.

\end{comment}