%\documentclass{uai2023} % for initial submission
\documentclass[accepted]{uai2023} % after acceptance, for a revised
                                    % version; also before submission to
                                    % see how the non-anonymous paper
                                    % would look like

%% There is a class option to choose the math font
% \documentclass[mathfont=ptmx]{uai2023} % ptmx math instead of Computer
% Modern (has noticable issues)
% \documentclass[mathfont=newtx]{uai2023} % newtx fonts (improves upon
 % ptmx; less tested, no support)
% NOTE: Only keep *one* line above as appropriate, as it will be replaced
%       automatically for papers to be published. Do not make any other
%       change above this note for an accepted version.

%% Choose your variant of English; be consistent
\usepackage[american]{babel}
% \usepackage[british]{babel}

%% Some suggested packages, as needed:
\usepackage{natbib} % has a nice set of citation styles and commands
    \bibliographystyle{plainnat}
    \renewcommand{\bibsection}{\subsubsection*{References}}
\usepackage{mathtools} % amsmath with fixes and additions
% \usepackage{siunitx} % for proper typesetting of numbers and units
\usepackage{booktabs} % commands to create good-looking tables
\usepackage{tikz} % nice language for creating drawings and diagrams

% for cross referencing the main text
% PLEASE ONLY USE xr IN THE SUPPLEMENTARY MATERIAL. 
% In the main paper, hard code any cross-reference to the supplementary material. 
\usepackage{xr} 
% \externaldocument{uai2023-template}

%*************************
% 自己加的宏包
\usepackage{algorithm}
\usepackage{algpseudocode}
\usepackage{amsmath}
\usepackage{stfloats}
\usepackage{color}
\usepackage{graphicx}
\usepackage{subfigure}
\usepackage{longtable}
\usepackage{makecell}
\usepackage{enumitem}
\usepackage{amssymb}
\usepackage{multirow}
\usepackage{svg} 
\usepackage{amsthm}
\usepackage{diagbox}
\newtheorem{theorem}{Theorem}
\newtheorem{lemma}{Lemma}
%********************************

%% Provided macros
% \smaller: Because the class footnote size is essentially LaTeX's \small,
%           redefining \footnotesize, we provide the original \footnotesize
%           using this macro.
%           (Use only sparingly, e.g., in drawings, as it is quite small.)

%% Self-defined macros
\newcommand{\swap}[3][-]{#3#1#2} % just an example

\title{Practical Privacy-Preserving Gaussian Process Regression via Secret Sharing\\(Supplementary Material)}

% The standard author block has changed for UAI 2023 to provide
% more space for long author lists and allow for complex affiliations
%
% All author information is authomatically removed by the class for the
% anonymous submission version of your paper, so you can already add your
% information below.
%
% Add authors
\author[1,2]{Jinglong Luo}
\author[2]{Yehong Zhang}
\author[2]{Jiaqi Zhang}
\author[2]{Shuang Qin}
\author[2]{Hui Wang}
\author[2]{Yue Yu}
\author[1,2]{Zenglin Xu}
% Add affiliations after the authors
\affil[1]{%
    Harbin Institute of Technology, Shenzhen, China
}
\affil[2]{%
    Peng Cheng Laboratory, Shenzhen, China
}
% \affil[*]{%
%     Correspondence to: Yehong Zhang $<$zhangyh02@pcl.ac.cn$>$ and Zenglin Xu $<$ $>$
% }
  
\begin{document}
  
\onecolumn %% Turn this off if single column is desired for the supplement
\maketitle

\appendix



\section{Correctness and Security analysis of PP-Exp}\label{app:ppexp}

\subsection{Proof of Theorem 1}
In terms of algorithm correctness, we need to consider the problem of overflow or underflow errors during the execution of Algorithm 2. First, the PP-Exp algorithm involves computations on two algebraic structures, namely, the ring of integers $\mathcal{Z}_L$ modulo $L$ and the fixed-point set $\mathcal{Q}_{<\mathcal{Z}_L,l_f>}$. Specifically, the operations on shares are performed on $\mathcal{Z}_{L}$ and others are performed on $\mathcal{Q}_{<\mathcal{Z}_L,l_f>}$. 

\noindent{\textbf{Underflow:}} The PP-Exp has the risk of underflow when calculating $e^{-\check{r}}$ and $e^{\check{d}}$ in Line 5 and Line 13 of Algorithm~2. The minimum value of $e^{-\check{r}}$ and $e^{\check{d}}$ is $e^{u_{min} -\check{r}_{max}}$. To guarantee that PP-Exp will not overflow during computation, we need ensure that $e^{u_{min} -\check{r}_{max}}$ can be expressed as a fixed-point number with precision $l_f$. This is equivalent to $e^{u_{min} -\check{r}_{max}} \cdot 2^{l_f}\geq1$. Therefore, $l_f\geq (\check{r}_{max} - u_{min})\log_2^e$ is needed to ensure that PP-Exp does not underflow during the calculation.

\noindent{\textbf{Overflow:}}
PP-Exp has the potential to overflow when computing $[e^u]_j$ for $j \in \{0,1\}$ in Line 14. Since $[e^u]_j$ is the share of $e^u$, the maximum value of the reconstruction process is
$$
[e^{u_{max}}]_0 +  [e^{u_{max}}]_1 = e^{u_{max} + \check{r}}\cdot 2^{l_f} \cdot ([e^{-\check{r}}]_0 + [e^{-\check{r}}]_1) = e^{u_{max}} \cdot 2^{2l_f}
$$
according to Line 14 of Algorithm 2. Therefore, $ e^{u_{max}} \cdot 2^{2l_f} \leq 2^{l-1}$ is required to ensure that the reconstruction process does not overflow. As $u<0$ (i.e., $u_{max} = 0$), it is achieved that PP-Exp does not overflow if $l_f< \frac{l-1}{2}$.
%$e^{u}$ is less than 1. Therefore, to ensure that PP-Exp does not overflow requires $l_f< \frac{l-1}{2}$. 

\subsection{Proof of Theorem 2}
In terms of algorithm security, we analyze the probability of PP-Exp privacy leakage and the \textit{expected degree of information leakage}.
The number of possible values of $u$ is $m_u$ and the number of possible values of $r$ is $m_r$, then the probability that the algorithm leads to additional privacy leakage of input $u$ is
\begin{equation}
    \frac{(1+2+\cdots+m_u)\cdot2}{m_um_r}=\frac{m_u\cdot (m_u-1)}{m_um_r}=\frac{m_u-1}{m_r}.
\end{equation}
The probability that the PP-Exp is \emph{secure} is $1 - \frac{m_u-1}{m_r} = \frac{m_r-m_u+1}{m_r}$.

Since the exponents are all negative in the Gaussian process regression, the number of $u$ values is reduced by half. This allows the probability of input $u$ leakage to be further reduced. For example, when $l_f = 2^{29}$, supposing the input $u$ takes values in the range $[-4,0]$ and $r$ takes values in the range $[-16,16)$ (i.e., the number of values of $u$ and $r$ are $2^{31}+1$ and $2^{34}+1$, respectively), the probability that the algorithm to be secure is 
$\frac{2^{34}-2^{31}}{2^{34}} =\frac{7}{8}$.

The \emph{expected degree of information leakage} is used to describe the amount of privacy leakage of the PP-Exp algorithm. It can be understood by the following simple example. Suppose $u \in \{-2,-1,0\}, r \in \{-2,-1,0, 1, 2\}$ and $d = u+r \in \{-4, -3, -2,-1, 0, 1, 2\}$. The probabilities of $d$ taking different values are as follows: 
\begin{equation}
\begin{array}{cc}
     &  \Pr\{d = -4\} = \Pr\{d = 2\} = \frac{1}{15},\\
    & \Pr\{d = -3\} = \Pr\{d = 1\} = \frac{2}{15},\\
   &  \Pr\{d = -2\} = \Pr\{d = -1\} = \Pr\{d = 0\}=\frac{3}{15}.   
\end{array}  
\end{equation}
If $d\in \{-2, -1, 0\}$, $u$ can take any of $-2, -1, 0$ such that no additional information about $u$ is revealed by $d$. Therefore, the degree of information leakage of $u$ is $\frac{3}{15}\cdot \frac{1}{3} = \frac{1}{5}$. If $d\in \{-4, 2\}$, $u$ will only correspond to unique values, and the degree of information leakage of $u$ is $2\cdot \frac{1}{15}\cdot 1$.  If $d\in \{-3, 1\}$, there are two possible values of $u$. The degree of information leakage of $u$ is $2 \cdot\frac{2}{15} \cdot \frac{1}{2} = \frac{2}{15}$. Therefore,  the \textit{degree of information leakage} of $u$ averaged over $d \in \{-4, -3, 1, 2\}$ is $\frac{2}{15} + \frac{2}{15} + \frac{1}{5} = \frac{7}{15}$. 

In PP-Exp, there are $\frac{2k}{m_um_r}, k = 1,2,\cdots, m_u-1$ probabilities that $m_u-k$ possible values of the input $u$ are excluded by $d$. Therefore, the \textit{average degree of information leakage} of input $u$ is 
\begin{equation}
(1-\frac{m_u -1}{m_r})\cdot\frac{1}{m_u} + \sum_{k = 1}^{m_u-1} (\frac{2k}{m_um_r})\cdot\frac{1}{k} = \frac{m_r -m_u +1}{m_um_r} + \frac{2(m_u-1)}{m_um_r} = \frac{m_u+m_r-1}{m_u\cdot m_r}.
\end{equation}
It is important to note that the PP-Exp algorithm only exposes the exact value of $u$ when the maximum and minimum values are in the range where $u$ and $r$ are taken simultaneously. This probability is 
\begin{equation}
    \frac{2}{mn}=\frac{2}{(2^{31}+1)\cdot (2^{34}+1)}<\frac{2}{2^{31}\cdot 2^{34}}=\frac{1}{2^{64}}.
\end{equation} 

% Suppose the number of possible values of $u$ is $m$ and the number of possible values of $r$ is $n$. There are $\frac{2k}{mn}, k = 1,2,\cdots, m-1$ probabilities that $m-k$ possible values of the input $u$ are excluded by $d$. Therefore, the number of possible values of input $u$ that the PP-Exp algorithm leaks is
% \begin{equation}
%     \frac{2((m-1)+2\cdot (m-2)+\cdots + (m-1))}{mn}=\frac{m^2-1}{3n},
% \end{equation}
% which is $\frac{m-\frac{1}{m}}{3n}$ of all possible values of $u$. For example, when $l_f = 29$ , assume that the input $u$ takes values in the range $[-4,0]$ and $r$ takes values in the range $[-16,16]$. PP-Exp gives away the number of possible values of input $u$ as $2^{26.4}$. This is approximately all possible values of $u$ for $\frac{1}{24}$.

\begin{algorithm}[b]\footnotesize
	\caption{Privacy-preserving Cholesky decomposition}
	\label{alg2}
    \textbf{Setup.} The servers determine an integer ring $\mathcal{Z}_L$.\\
    \textbf{Input.} 
 $S_0$ holds the share $[\mathbf{U}]_0$; $S_1$ hold the share $[\mathbf{U}]_1$.
	\begin{algorithmic}[1]
		\State 	// \textbf{Offline phase:}
		\State $T$ generates $\mathbf{A}, \mathbf{B} \in Z^{n\times n}_L$ randomly, and calculates $\mathbf{C} = \mathbf{AB}$;
		\State $T$ sends $([\mathbf{A}]_j, [\mathbf{B}]_j,[\mathbf{C}]_j)$ to $S_j$.
		\State // \textbf{Online phase:}
		\For{$j \in \{0,1\}$}:
	        \State $[d_1]_j \leftarrow [u_{1,1}]_j$;

    	    \State $S_j$ calculates $[l_{1:n,1}]_j = [u_{1:n, 1}]_j / [u_{1,1}]_j$ by calling PP-Div;
    		\For{$k \in \{2,3,\ldots,n\}$}:
                \State $S_j$ calculates $[d_k]_j = [u_{k,k}]_j - [\sum_{m = 1}^{k-1}l^{2}_{k,m}d_m]_j$ by calling  PP-MM;
                %\State $S_j$ calculates $[g_{k}]_j = [\sum_{m = 1}^{k-1}l_{k,m}l_{k,m}d_m]_j$ by calling  PPMM;
    	        %\For{$h \in \{k+1,k+2,\ldots,n\}$}:
                \State $S_j$ calculates $[\hat{l}_{k+1:n,k}]_j =  [u_{k+1:n,k}]_j - [\sum_{m = 1}^{k-1}l_{k,m}d_ml_{k+1:n,m}]_j$ by calling  PP-MM;
                \State $S_j$ calculates $[l_{k+1:n,k}]_j = [\hat{l}_{k+1:n,k}]_j / [d_k]_j$ by calling  PP-Div;
                %\EndFor
            \EndFor
	    \EndFor
        \Comment $S_j$ gets $[\mathbf{L}]_j, [\mathbf{D}]_j$.
	\end{algorithmic}
   \textbf{Output.} 
	$S_j$ output the share $[L]_j, [D]_j$ for $j \in \{0,1\}$.
\end{algorithm}

\section{Privacy-preserving matrix inversion}\label{app:ppmi}
In this section, we will give the construction details of the \emph{privacy-preserving matrix inversion} (PP-MI) operation.
The main idea of the PP-MI is to transform the matrix inversion process into MPC-friendly operations such as multiplication and division.
%
Next, we will first briefly recall the process of positive definite matrix inversion via Cholesky decomposition and then provide the 
%The specific content includes an introduction to the Cholesky decomposition and the
pseudo-code of the PP-MI algorithm. 

%The main idea of the PP-MI is to transform the inversion process into MPC-friendly operations such as multiplication and division.

%Specifically for the positive definite symmetric matrix
Let $\mathbf{U} = (u_{h, k})_{h, k = 1, 2, \ldots, n}$ be an $n\times n$ positive definite matrix. It can be decomposed as $\mathbf{U} = \mathbf{LDL}^\top$ where $\mathbf{L}$ is a unit lower triangular matrix and $\mathbf{D}$ is a diagonal matrix.
%
Let $d_k$ be the $k^{th}$ diagonal element of $\mathbf{D}$ for $k = 1, \ldots, n$. 
We can calculate $\mathbf{L},\mathbf{D}$ through \eqref{eq4}. 
For $k = 1,2,\dots, n$ and $h = k+1, k+2, \dots, n$.
\begin{equation}
\label{eq4}
\left\{
\begin{aligned}
& d_k = u_{k,k} - \sum_{m = 1}^{k-1}l^{2}_{k,m}\cdot d_m\ , \\
& l_{h,k} = (u_{h,k} - \sum_{m = 1}^{k-1}l_{h,m}\cdot l_{k,m}\cdot d_m)/ d_k\ .
\end{aligned}
\right.
\end{equation} 

Supposing $\mathbf{U\Lambda} = \mathbf{I}$ for a given $n \times n$ identity matrix $\mathbf{I}$. Next, we will introduce how to compute $\mathbf{\Lambda}$ from $\mathbf{I, L}$, and $\mathbf{D}$. Then, we can have $\mathbf{U}^{-1} = \mathbf{\Lambda}$.  

Let $\mathbf{V} \triangleq \mathbf{DL}^\top\mathbf{\Lambda}$. Then, $\mathbf{U\Lambda} = (\mathbf{LDL}^\top)\mathbf{\Lambda} = \mathbf{LV} = \mathbf{I}$ and $\mathbf{V}$ is a unit lower triangular matrix. For $k = 1, 2,\dots, n$ and $h = k+1, k+2,\dots, n$, we can calculate each element of $\mathbf{V}$ as:
\begin{equation}
\label{eq5}
%\left\{
% \begin{aligned}
% & v_{1,k} = b_{1,k}, \\
v_{h,k} = -\sum_{m=1}^{h-1}v_{m,k}\cdot l_{h,m}.
%\end{aligned}
%\right.
\end{equation} 

we can calculate the matrices $\mathbf{\Lambda}$ using \eqref{eq6}:
\begin{equation}
\label{eq6}
\mathbf{\Lambda} = \mathbf{U}^{-1} = (\mathbf{(LDL^\top)})^{-1} = (\mathbf{L}^{-1})^\top \mathbf{D}^{-1} \mathbf{L}^{-1} = \mathbf{V}^\top \mathbf{D}^{-1} \mathbf{V}.
\end{equation} 


% Let $\mathbf{\hat{V}} \triangleq \mathbf{D^{-1}V} = \mathbf{L}^\top \mathbf{\Lambda}$. For $k = 1, 2,\dots, n$ and $h = n-1, n-2,\dots,1$, we can calculate the matrices $\mathbf{\Lambda}$ using \eqref{eq6}:
% \begin{equation}
% \label{eq6}
% \left\{
% \begin{aligned}
% & \lambda_{n,k} = \hat{v}_{n,k}, \\
% & \lambda_{h,k} = \hat{v}_{h,k} -\sum_{m=h+1}^{n}\lambda_{m,k}\cdot l_{h,m}.
% \end{aligned}
% \right.
% \end{equation} 

Based on equations \eqref{eq4}, \eqref{eq5}, and \eqref{eq6}, we can implement PP-MI by reasonably invoking privacy-preserving multiplication (PP-MM) and privacy-preserving division (PP-Div) in \cite{knott2021crypten}. Let $\mathbf{U}$ be the an $n \times n$ positive definite matrix, $u_{a;b, k:h} \triangleq (u_{i, j})_{i=a, \ldots, b}^{j = k, \ldots, h}$. The PP-MI algorithm for $\mathbf{U}$ consists of three parts, and their pseudo-codes refer to the Algorithms~\ref{alg2}-\ref{alg4}. Note that the pseudo-codes are note exactly the same as \eqref{eq4}-\eqref{eq6} since we vectorize some steps for accelerating the computation. 

% \begin{algorithm}[t]\footnotesize
% 	\caption{Privacy-preserving forward}
% 	\label{alg3}
% 	\begin{algorithmic}[1]
% 		\Require
% 		$S_0$ holds the share $[U]_0, [L]_0$; $S_1$ hold the share $[U]_1,[L]_1$.
% 		\Ensure
% 		$S_0$ gets the share $[V]_0$, $S_1$ gets the share $[V]_1$.
% 		\State 	\textbf{Offline phase:}
% 		\State $T$ generates $a, b, c = ab$;
% 		\State $T$ sends $([a]_j, [b]_j, [c]_j)$ to $S_j$.
% 		\State \textbf{Online phase:}
% 		\For{$k \in \{1, 2,\dots,n\}$}
% 		    \For{$h \in \{k+1,k+2,\dots,n\}$}
%                 \State $S_j$ calculates $[v_{h,k}]_j =  - [\sum_{m = 1}^{h-1}v_{h,k}l_{h,m}]_j$ by calling  PP-MM;
%             \EndFor
%         \EndFor
%         \Comment $S_j$ gets $[V]_j$.
% 	\end{algorithmic}
% \end{algorithm}

\begin{algorithm}[t]\footnotesize
	\caption{Privacy-preserving forward}
	\label{alg3}
       \textbf{Setup.} The servers determine an integer ring $\mathcal{Z}_L$.\\
	\textbf{Input.}
		$S_0$ holds the share $[\mathbf{U}]_0, [\mathbf{L}]_0$; $S_1$ hold the share $[\mathbf{U}]_1,[\mathbf{L}]_1$.
	\begin{algorithmic}[1]
		\State 	// \textbf{Offline phase:}
		\State $T$ generates $\mathbf{A}, \mathbf{B} \in Z^{n\times n}_L$ randomly, and calculates $\mathbf{C} = \mathbf{AB}$;
		\State $T$ sends $([\mathbf{A}]_j, [\mathbf{B}]_j,[\mathbf{C}]_j)$ to $S_j$.
		\State // \textbf{Online phase:}
		\State $v_{1, 1} \leftarrow 1$
		\For{$j \in \{0,1\}$}
% 		\For{$k \in \{1, 2,\dots,n\}$}
		    \For{$k \in \{2,3,\dots,n\}$}
                \State $S_j$ calculates $[v_{k,1:k-1}]_j =  - [\sum_{m = 1}^{k-1}v_{m,1:k-1}l_{k,m}]_j$ by calling  PP-MM;
            \EndFor
        \EndFor
        \Comment $S_j$ gets $[\mathbf{V}]_j$.
	\end{algorithmic}
       \textbf{Output.}
	$S_j$ outputs the share $[\mathbf{V}]_j$ for $j\in \{0,1\}$.
\end{algorithm}

% \begin{algorithm}[t]\footnotesize
% 	\caption{Privacy-preserving forward}
% 	\label{alg3}
% 	\begin{algorithmic}[1]
% 		\Require
% 		$S_0$ holds the share $[U]_0, [L]_0$; $S_1$ hold the share $[U]_1,[L]_1$.
% 		\Ensure
% 		$S_0$ gets the share $[V]_0$, $S_1$ gets the share $[V]_1$.
% 		\State 	\textbf{Offline phase:}
% 		\State $T$ generates $a, b, c = ab$;
% 		\State $T$ sends $([a]_j, [b]_j, [c]_j)$ to $S_j$.
% 		\State \textbf{Online phase:}
% 		\For{$k \in \{1, 2,\dots,n\}$}
% 		    \For{$h \in \{k+1,k+2,\dots,n\}$}
%                 \State $S_j$ calculates $[v_{h,k}]_j =  - [\sum_{m = 1}^{h-1}v_{m,k}l_{h,m}]_j$ by calling  PPMM;
%             \EndFor
%         \EndFor
%         \Comment $S_j$ gets $[V]_j$.
% 	\end{algorithmic}
% \end{algorithm}

% \begin{algorithm}[t]\footnotesize
% 	\caption{Privacy-preserving backward}
% 	\label{alg4}
% 	\begin{algorithmic}[1]
% 		\Require
% 		$S_0$ holds the share $[V]_0, [L]_0, [D]_0$; $S_1$ hold the share $[V]_1,[L]_1, [D]_1$.
% 		\Ensure
% 		$S_0$ gets the share $[\Lambda]_0$, $S_1$ gets the share $[\Lambda]_1$.\\
% 	    \textbf{Offline phase:}
% 	    \State $T$ generates $a, b, c = ab$;
% 		\State $T$ sends $([a]_j, [b]_j, [c]_j)$ to $S_j$.
% 		\State \textbf{Online phase:}
%         \State $S_j$ takes $[V]_j, [D]_j$ as input to calculate $[\hat{V}]_j = [D^{-1}]_j[V]_j =  [V]_j / [D]_j$ by calling  PPDiv;
% 		\For{$k \in \{1, 2,\dots,n\}$}
% 		    % \For{$j \in \{0, 1\}$}
% 		    \State $[\lambda_{n,k}]_j \leftarrow [\hat{v}_{n,k}]_j$;
% 		    %\EndFor
% 		    \For{$h \in \{n-1,n-2,\dots,1\}$}
%                 \State $S_j$ takes $[\lambda_{n,k}]_j, [l_{k, h}]_j$ as input to calculate $[\lambda_{h,k}]_j =  [\hat{v}_{h,k}]_j - [\sum_{m = h+1}^{n}\lambda_{m,k}l_{h,m}]_j$ by calling  PPMM;
%             \EndFor
%         \EndFor
%         \Comment $S_j$ gets $[\Lambda]_j$.
% 	\end{algorithmic}
% \end{algorithm}

\begin{algorithm}[t]\footnotesize
	\caption{Privacy-preserving backward}
	\label{alg4}
     \textbf{Setup.} The servers determine an integer ring $\mathcal{Z}_L$.\\
	 \textbf{Input.}
		$S_0$ holds the share $[\mathbf{V}]_0, [\mathbf{L}]_0, [\mathbf{D}]_0$; $S_1$ hold the share $[\mathbf{V}]_1,[\mathbf{L}]_1, [\mathbf{D}]_1$.
	\begin{algorithmic}[1]
	    \State // \textbf{Offline phase:}
		\State $T$ generates $\mathbf{A}, \mathbf{B} \in Z^{n\times n}_L$ randomly, and calculates $\mathbf{C} = \mathbf{AB}$;
		\State $T$ sends $([\mathbf{A}]_j, [\mathbf{B}]_j,[\mathbf{C}]_j)$ to $S_j$.
		\State // \textbf{Online phase:}
          \State $S_j$ calculates $[\mathbf{\Lambda}]_j = [\mathbf{V}^\top]_j [\mathbf{D}^{-1}]_j [\mathbf{V}]_j$ by calling PP-Div and PP-MM.
        \Comment $S_j$ gets $[\mathbf{\Lambda}]_j$.
	\end{algorithmic}
     \textbf{Output.}
		$S_j$ outputs the share $[\mathbf{\Lambda}]_j$ for $j \in \{0,1\}$.
\end{algorithm}


%The PPMI is implemented by calling Privacy Preserving Matrix Multiplication(PPMM), which is proposed in \cite{SecureNN} by using matrix Beaver-triples\cite{Beavertriples}. In PPMM, for all $j \in \{0, 1\}$ the computing servers $S_j$ take the additive share $[X]_j, [Y]_j$ of matrix $X \in Z^{m \times n}_{2^l}, Y \in Z^{n \times s}_{2^l}$ as inputs, respectively. Then they interactive computing the additive shares of $X\cdot Y \in Z^{m \times s}_{2^l} $ privately with matrix Beaver-triples $(A,B,C)$, generated by assistant server $T$, for $A\in Z^{m \times n}_{2^l}, B \in Z^{n \times s}_{2^l}, C = A\times B$. Specifically, for $j \in \{0, 1\}$, $T$ sends $([A]_j, [B]_j, [C]_j)$ to $S_j$. The computing server $S_j$ calculates $[D]_j = [X]_j-[A]_j$, $[E]_j = [Y]_j - [B]_j$, and send them to each other. Then the computing server $S_j$ reveals $D= [D]_0+[D]_1, E = [E]_0+[E]_1$ and gets $[X\cdot Y]_j = -jD \cdot E +[X]_j \cdot E + D \cdot [Y]_j + [C]_j$.
% In PPMI, for all $j \in \{0, 1\}$ the computing servers $S_j$ take the additive share $[X]_j$ of matrix $X \in Z^{k \times k}_{2^l} $ as inputs, respectively. Then they deduce the shares of $X^{-1}$ privately with matrix Beaver-triples generated by $T$.  The specific structure of the PPMI is shown in algorithm \ref{alg1} and it satisfies Theorem \ref{theorem1}.

%% Security proof
\section{Communication complexity analysis}\label{app:proof}
% In this section, we first analyze the communication complexity of PP-Exp and PP-MI.  Then, we prove the correctness and security of them under the semi-honest model using \textit{simulation-based} method. 

% \subsection{Theoretical communication rounds and complexity}
In this section, we will analyze the communication complexity of PP-MI step by step. We only analyze the communication complexity in the online phase. The process of the offline phase can be done by the server during idle time. 

%The analysis results are summarized in Table~\ref{table2}.

% For PP-Exp, the computing server only needs to spend one round of communication in the online phase for reconstructing $u-r$ (i.e., line 7 in Algorithm~\ref{alg-exp}) and the amount of communication is $2l$. By utilizing the parallel computing ability of the computer, PP-Exp can complete the computation of a vector containing $n$ elements in the case of one round of communication whose communication volume is $2nl$.

For PP-MI, we will analyze the communication cost for Algorithms~\ref{alg2},~\ref{alg3}~and~\ref{alg4}, separately. For Algorithm~\ref{alg2}, there is $1$ vector-wise division in line 7. For each $k = 2, ..., n-1$, there are $4$ rounds of vector-wise multiplication for lines 9-10 and $1$ vector-wise division for line 11. When $k = n$, there are only $2$ rounds of vector-wise multiplication for line 9. We do not need to compute lines 10-11 when $k = n$. There are $17$ rounds of communication for each element/vector-wise PP-Div and $1$ round of communication for each element/vector-wise PP-MM. Consequently, the total communication round for Algorithm~\ref{alg2} is $17 + (n-2)*(4+17) + 2 = 21n-23$.
%
%In lines 10-15 of Algorithm~\ref{alg2}, there are $n-1$ rounds of operations.  The first $n-2$ rounds of operations consist of $4$ multiplications and $1$ divisions and the $n-1$ round consists of $2$ multiplications. Therefore, the total number of communication rounds is $21n-23$.
Since there are $\mathcal{O}(n^3)$ element-wise multiplications and $\mathcal{O}(n^2)$ element-wise divisions included in Algorithm~\ref{alg2}, its communication complexity is $\mathcal{O}(n^{3}l)$.
%these steps. Therefore, executing a complete privacy-preserving matrix decomposition requires $21n -6$ rounds of communication and the communication complexity is $\mathcal{O}(n^{3}l)$. 

For Algorithm~\ref{alg3}, it includes $1$ vector-wise multiplication for each $k = 2, \ldots, n$
%In steps 2 of the privacy-preserving forward algorithm, it includes $n-1$ rounds of calculation
and each round of computation (i.e., line 8 for each $k$) consists of $(k-1)^2$ element-wise multiplications. Therefore,
%to implement the privacy-preserving forward algorithm,
Algorithm~\ref{alg3} incurs a total of $n-1$ rounds of communication and its communication complexity is $\mathcal{O}(n^{3}l)$.

For Algorithm~\ref{alg4}, it contains $1$ matrix division and 1 matrix multiplication with $n^2$ elements in line 5. Therefore, the total number of communication rounds required to execute Algorithm~\ref{alg4} is $17+1=18$. The communication complexity is $\mathcal{O}(n^{2}l)$.

In summary, the PP-MI algorithm incurs $21n-23+n-1+18=22n-6$ rounds of communication and the communication complexity is $\mathcal{O}(n^{3}l)$.

%Compared with the privacy-preserving forward algorithm, the privacy-preserving backward algorithm additionally includes $1$ times of $n^2$ element divisions in the first step. Therefore, the total number of communication rounds required to execute the privacy protection backward is $n+16$, and the total communication volume is also $\mathcal{O}(n^{3}l)$. In summary, it takes a total of $23n+9$ rounds of communication to execute the PP-MI algorithm, and the communication complexity is $\mathcal{O}(n^{3}l)$.

% \begin{table}[t]
% 	\footnotesize
% 	\centering
% 	\caption{Communication rounds \& communication  complexity.}
% 	\begin{tabular}{cccc}
% 		\toprule
% 		Algorithms  & Input size & Communication rounds &  Communication  complexity \\
% 		\midrule
% 		PP-Exp  &$n$   & $1$  &$2nl$     \\
% 		PP-MI  & $n\times n$  & $23n-8$  & $\mathcal{O}(n^3l)$      \\

% 		\bottomrule
% 	\end{tabular}
	
% 	\footnotesize{$l=64$ is the data length; $n$ is the dimension of the matrix or vector.}
% 	\label{table2}
% \end{table}

% \subsection{Security proof}
% In this section, we will use the simulation-based approach to formalize security proofs for the PP-Exp and PP-MI algorithms under the semi-honest model. We first introduce the definition of the \textit{simulation-based} security as shown in \cite{Lindell2017simulate}.

% \begin{itemize}[leftmargin=*,itemsep=5pt,topsep=0pt,parsep=0pt,partopsep=0pt]
% 	\item Let $f = (f_0, f_1)$ represent an operation where $f_i$ is the function executed on Server $S_i$ for $i \in \{0, 1\}$, $\pi$ be the privacy-preserving algorithm for computing $f$, $x_i$ be the input of $S_i$, and $f_i(x_0,x_1)$ be the output of $S_i$ on the inputs $(x_0,x_1)$.
% 	\item The \emph{view} of $S_i$ for $i \in \{0,1\}$ during the execution of $\pi$ on inputs $(x_0,x_1)$ is denoted as $view_i^{\pi}(x_0,x_1)$. Specifically, $view_i^{\pi}(x_0,x1) = (x_i, r_i; m_{i1}, \ldots, m_{it})$ where $r_i$ equals the contents of the $S_i$’s internal random tape and $m_{ij}$ for $j=1, 2, \ldots, t$ represents the $j$-th message received by $S_i$ during the execution of the privacy-preserving algorithm.
% \end{itemize}

% \newtheorem{definition}{Definition}

% \begin{definition}
% 	We say that $\pi$ is secure against \textit{semi-honest } adversaries $\mathcal{A}$ if there is a probabilistic polynomial time simulator (PPT) $Sim$ that can simulate the internal state of $S_0$ and $S_1$ by calculating the servers' inputs, outputs, and some public parameters $\lambda$ such that
% 	\begin{equation*}
% 		\begin{split}
% 			\{Sim_0(x_0, f_0(x_0,x_1), \lambda\}_{x_0,x_1\in Z_L}\equiv \{view^{\pi}_0(x_0,x_1)\}_{x_0,x_1\in Z_L};\\
% 			\{Sim_1(x_1, f_1(x_0,x_1), \lambda\}_{x_0,x_1\in Z_L}\equiv \{view^{\pi}_1(x_0,x_1)\}_{x_0,x_1\in Z_L}.
% 		\end{split}
% 	\end{equation*}
% \end{definition}
% The ``$\equiv$'' in the equation means computationally indistinguishable (i.e., $\mathcal{A}$ cannot distinguish between the simulated view and the real view.). See Section 2 of~\cite{Lindell2017simulate} for its formal definition.
% %
% During the execution of the operations, it is assumed that one of the servers is controlled by the \textit{semi-honest} adversary $\mathcal{A}$. This means that $\mathcal{A}$ obtains the internal state of the server during the execution of the operations.
% %$\mathcal{A}$ cannot distinguish between the simulated view and the real view.

% Note that during the execution of the privacy-preserving algorithms, $T$ only generates random numbers that meet the public conditions in the offline phase. Therefore, there is a PPT simulator $Sim_T$ which can randomly generate corresponding random numbers to simulate $T$ according to public information, making the semi-honest adversary $\mathcal{A}$ cannot distinguish between the simulated view and the real view.

% Next, we will show the security of PP-Exp and PP-MI as declared in Theorems~\ref{theorem-exp}~and~\ref{theorem-mi}.
% Firstly, the following lemma is needed, which has been widely used in existing privacy-preserving ML works \cite{Mohassel2017SecureML,Wagh2019SecureNN}.
% %for showing the security of PP-Exp.
% %
% \begin{lemma}
% 	\label{lemma1}
% % 	If $a$ and $b$ are uniformly distributed in $Z_L$ and $a$ is independent of $b$, then $a+b$ is also uniformly distributed in $Z_L$ and independent of $a$.
% 	For any $a$ in $Z_L$, if $b$ is uniformly distributed in $Z_L$ and independent of $a$, then $a+b$ is also uniformly distributed in $Z_L$ and independent of $a$.
% \end{lemma}

% %The proof of Lemma~\ref{lemma1} can be found in Section xxx of~\cite{TODO}.

% %\noindent \textbf{Proof} 
% %\begin{proof}
% Theorem~\ref{theorem-exp} holds if and only if the PPExp algorithm satisfies the correctness and security requirements under the semi-honest model defined in this paper. The correctness of the PPExp can be proved through the following equation:
% %
% \begin{equation}	
% 	\begin{split}
% 		&[e^u]_0 + [e^u]_1= e^d \times [e^r]_0 + e^d \times [e^r]_1 \\
% 		& = e^d \times ([e^r]_0 + [e^r]_1) = e^d \times e^r \\
% 		& = e^{u-r} \times e^r = e^u.
% 	\end{split}
% \end{equation}
% %
% In terms of security, we need to prove that during the execution of the PP-Exp, the view of each computing server is simulatable. Specifically, we assume that a computing server is controlled by the adversary $\mathcal{A}$ during the execution of the PP-Exp. Without loss of generality, we assume that $\mathcal{A}$ controls $S_1$.

% The view of $S_1$ is $\{view^{PP-Exp}_1(u_0,u_1)\}_{u_0,u_1\in z_L} =  \{[u]_1, [r]_1, [e^r]_1, d\}$ where $[r]_1$ and $[e^r]_1$ are random values generated by $T$. Therefore, it does not contain any information of the privacy data $u$. According to Lemma~\ref{lemma1}, $d = u - r$ is uniformly distributed in $Z_{L}$ and is independent of $u$ since $r$ is uniformly selected in $z_{L}$.
% %
% Therefore, there is a PPT simulator $Sim_0$ that can perfectly simulate the view of $S_0$ by knowing the inputs and outputs of $S_0$. In the same way, it can be proved that the view of $S_0$ is simulatable during the execution of the PP-Exp.

% %\end{proof}

% %\noindent \textbf{Proof} 
% Theorem \ref{theorem-mi} holds if and only if the PP-MI algorithm satisfies the correctness and security requirements under the semi-honest model defined in this paper. 

% The correctness and security of PP-MI depend on the correctness and security of PP-MM and PP-Div algorithms. For the detailed proof process, please refer to \cite{knott2021crypten}. By using general composability framework, the security of Theorem \ref{theorem-mi} can be achieved.
%\hfill \blacksquare
%%%%%%%%%%%%%%%%%%%%%%%%%%
\section{Additional Experimental Results (PP-GPR vs. DP-GPR)}

Another technique that has been widely used to achieve privacy preservation in machine learning is \emph{differential-privacy}~(DP)~\citep{dwork2014algorithmic}.
By adding noise that satisfies the privacy budget, DP-based privacy-preserving machine learning algorithms achieve some privacy protection at the expense of the accuracy of the model. In the domain of Gaussian process regression (GPR) algorithms,~\citet{smith2018differentially} proposed a significant contribution by presenting the first privacy-preserving GPR algorithm. However, it is important to note that their approach only safeguards the privacy of the model outputs $\mathbf{y}$ through the addition of noise levels that adhere to a given privacy budget. Consequently, their algorithm falls short of achieving complete protection for both the model inputs and outputs.

To demonstrate the effectiveness of our algorithm, we present the loss of predictive mean of DP-GPR on the diabetes dataset for different levels of DP guarantee (i.e., varying $\epsilon$) using the SE kernel. As can be seen, even with a large privacy budget (i.e., $\epsilon = 1.0$), DP-GPR incurs significantly larger computational errors due to the additional DP noise compared to the proposed PP-GPR. The $Loss_{\mu}$ of our proposed algorithm is kept below $0.01\%$ (Table 2 in the main paper). 

\begin{table*} [h]\footnotesize
\centering
\caption{Comparison of our algorithm with the DP-based algorithm in the relative difference $Loss_{\mu}$.}
\begin{tabular}{c|c|c|c|c|c}
\toprule
          $n$  & Test & $\epsilon = 1.0$  & $\epsilon = 0.5$  & $\epsilon = 0.2$ & \textbf{Ours}  \\ 
          
\hline 
          80        &  20  &           $18.6\%(\pm23.4)$ & $42.9\%(\pm188.3)$ & $96.6\%(\pm584.8)$  & $0.0007\%(\pm3.6e-04)$ \\
 150     & 50   & $27.8\%(\pm19.4)$ & $61.2\%(\pm311.6)$ & $148.6\%(\pm1457.7)$ & $0.0018\%(\pm1.3e-03)$ \\
           300      & 100      &$18.3\%(\pm10.5)$ & $33.5\%(\pm23.7)$  & $80.6\%(\pm133.3)$&$0.0058\%(\pm2.5e-03)$  \\
\bottomrule
\end{tabular}\label{tab:dp-gp}
\end{table*}
%%%%%%%%%%%%%%%%%%%%%%

\bibliography{luo_732}

\end{document}
