
\section{Our Data Structures}\label{sec:data}

Our algorithm's high-level idea is the following. We apply importance sampling (Definition~\ref{def:importance_sampling}) to approximate kernel density at a query point $q$ efficiently. We want to sample data points with probability according to their contribution to the estimation. Ideally, given query point $q \in \R^d$ and data set $X \subset \R^d$, we can sample each data point in $X$ with probability proportional to the inverse of the distance from query $q$. Unfortunately, we have no access to query points when preprocessing. Hence we make use of LSH (Definition~\ref{def:LSH_family}) to overcome this problem.
In general, given a query $q$, LSH can recover its near points with high probability while the probability of recovering far points is bounded by a small quantity. To apply LSH, we first run 
% \begin{align*}
$R=\lceil \frac{1}{f_{\mathsf{KDE}}} \rceil$
% \end{align*}
rounds sampling, in which we sample each data point with probability $\frac{1}{2^r n f_{\mathsf{KDE}}}$ in $r$-th round. Then we obtain $R$ subsampled data sets. Given query $q$, we use LSH to recover those points both in level set $L_r$ and the $r$-th subsampled data sets. Hence we get the sampled data points and the corresponding sampling rates (in other words ``importance''). Then we construct the estimator as in Definition~\ref{def:importance_sampling}. 
Finally, we repeat the estimation process independently and take the median to get $(1 \pm \epsilon)$ approximation with high probability.

\subsection{LSH Data Structure}\label{app:data_structure_lsh_formal}

In this section, we present our LSH data structure with the following procedures:

{\bf Initialize.} Given a data set $\{x_1, \cdots, x_n\} \in \R^d$ and integral parameters $k, L$, it first invokes private procedure \textsc{ChooseHashFunc}. The idea behind this is to amplify the ``sensitivity'' of hashing by concatenating $k$ basic hashing functions from the family $\mathcal{H}$ (Algorithm~\ref{alg:LSH_public_app} line~\ref{lin:basic_hash_family}) into new functions. Thus we obtain a family of ``augmented'' hash function $\mathcal{H}_l, l \in [L]$ (Algorithm~\ref{alg:LSH_private_app} line~\ref{lin:LSH_sample_k_functions}). We follow by \textsc{ConstructHashTable} in which we hash each point $x_i$ using the hashing function $\mathcal{H}_{l}$. Then we obtain $L$ hash tables corresponding to $L$ hash functions which can be updated quickly.

{\bf Recover.} Given a query $q \in \R^d$, it finds the bucket where $q$ is hashed by $\mathcal{H}_l$ and retrieves all the points in the bucket according to hashtable $\mathcal{T}_l$. This operation applies to all $L$ hashtables.

{\bf UpdateHashTable.} Given a new data point $z \in \R^d$ and index $i \in [n]$, it repeats following operations for all $l \in [L]$: find bucket $\mathcal{H}_l(z)$ and insert point $z$; find bucket $\mathcal{H}_l(x_i)$ and delete point $x_i$.

Note that traditional LSH data structure only has \textsc{Initialize} and \textsc{Recover} procedures. To make it a dynamic structure, we exploit its hash storage. We design \textsc{UpdateHashTable} procedure so that we can update the hash table on the fly. This procedure provides guarantee for dynamic kernel density estimation.

\subsection{Initialize Part of Data Structure}


\begin{algorithm}[!ht]\caption{LSH, private procedures}\label{alg:LSH_private_app}
\begin{algorithmic}[1]
\State {\bf data structure} \textsc{LSH}
\State
\State {\bf private}
\Procedure{\textsc{ChooseHashFunc}}{$k,L\in \mathbb{N}_+$}\label{lin:choose_hash_func}
    \For{$l \in [L]$}
        \State \Comment{Amplify hash functions by concatenating}
        \State $\mathcal{H}_{l} \leftarrow$ sample $k$ hash functions $(f_{1,l},f_{2,l},\cdots,f_{k,l})$ from $\mathcal{H}$ \label{lin:LSH_sample_k_functions}
    \EndFor
\EndProcedure

\State
\Procedure{\textsc{ConstructHashTable}}{$\{x_i\}_{i\in[n]}\subset \mathbb{R}^d$}\label{lin:construct_hash_table}
    \For{$l\in [L]$}
        \For{$i\in [n]$}
            \State $\mathcal{H}_l(x_i)$.\textsc{Insert}($x_i$) \label{lin:find_bucket_insert_element}
            \State $\mathcal{T}_l\leftarrow \mathcal{T}_l\cup \mathcal{H}_l(x_i)$ \Comment{Creat hashtable by aggregating buckets} \label{lin:aggregate_hash_table}
        \EndFor
    \EndFor
\EndProcedure

\State {\bf end data structure}
\end{algorithmic}
\end{algorithm}

In this section, we present the initialize part of our data structure.
We start by analyzing the space storage for \textsc{LSH} and \textsc{DynamicKDE}. Then we state the result of running time for \textsc{LSH} in Lemma~\ref{lem:LSH_initialize_upper_bound} and \textsc{DynamicKDE} in Lemma~\ref{lem:dynamic_KDE_initialize}. 
We first show the space storage of LSH part in our data structure.

\begin{lemma}[Space storage of \textsc{LSH}]\label{lem:LSH_storage}
Let $ X = \{x_1, x_2, \dots, x_n\} \subset \mathbb{R}^d $ be a dataset of $ n $ points in $ d $-dimensional space. Consider an \textsc{LSH}-based data structure that constructs $ L $ independent hash tables using $ k $-wise concatenated hash functions from an \textsc{LSH} family $ \mathcal{H} $. 

The space required to initialize and store the \textsc{LSH} data structure, including hash functions and the constructed hashtables, is:
\begin{align*}
    O(Lkdn^{o(1)} + Ln).
\end{align*}

\begin{proof}
The space storage comes from two parts: \textsc{ChooseHashFunc} and \textsc{ConstructHashTable} in Algorithm~\ref{alg:LSH_private_app}.

{\bf Part 1.} \textsc{ChooseHashFunc} (line~\ref{lin:choose_hash_func}) takes $L,k$ as input. 
It has a for loop with $L$ iterations.
In each iteration, it samples $k$ functions (line~\ref{lin:LSH_sample_k_functions}) from hash family $\mathcal{H}$ to create $\mathcal{H}_l$, which uses $O(kdn^{o(1)})$ space.
Thus the total space usage of \textsc{ChooseHashFunc} is $L\cdot O(kdn^{o(1)})=O(Lkdn^{o(1)})$.

{\bf Part 2.} \textsc{ConstructHashTable} (line~\ref{lin:construct_hash_table}) takes data set $\{x_i\}_{i\in[n]}$ and parameter $L$ as input.
It has two recursive for loops.
\begin{itemize}
    \item The first for loop repeats $L$ iterations.
    \item The second for loop repeats $n$ iterations.
\end{itemize}
The space storage of the inner loop comes from line~\ref{lin:insert} and line~\ref{lin:aggregate_hash_table}, which is $O(1)$.
Thus the total space storage of \textsc{ConstructHashTable} is $L\cdot n\cdot O(1)=O(Ln)$.

The final space storage of \textsc{Initialize} is 
\begin{align*}
    {\bf Part 1}+{\bf Part 2}
    =&~O(Lkdn^{o(1)}+Ln).
\end{align*}
% Thus, we complete the proof.
\end{proof} 

\end{lemma}

Thus, the LSH data structure maintains subquadratic storage complexity while enabling efficient approximate nearest-neighbor search.

\begin{algorithm}[!ht]\caption{Dynamic KDE, members and initialize part, informal version of Algorithm~\ref{alg:dynamic_KDE_initialize}}\label{alg:dynamic_KDE_initialize_pseudo}
\begin{algorithmic}[1]
\State {\bf data structure} \textsc{DynamicKDE}      \Comment{Theorem~\ref{thm:main_result}}
% \State
\State {\bf members} 
    \State \hspace{4mm} For ${i\in[n]}$,$x_i \in \mathbb{R}^d$ \Comment{dataset $X$}
    \State \hspace{4mm} $K_1, R, K_2 \in \mathbb{N}_+$ \Comment{Number of repetitions} 
    \State \hspace{4mm} For $a \in [K_1]$, $r \in [R]$, $\mathcal{H}_{a,r} \in \textsc{LSH}$ \label{lin:instance_LSH} \Comment{Instances from \textsc{LSH} class}
\State {\bf end members}
% \State
\Procedure{\textsc{Initialize}}{$X \subset \R^d,\epsilon \in (0,1),f_{\mathsf{KDE}} \in [0,1]$} \Comment{Lemma~\ref{lem:init}}
    \State Initialize $K_1,R$ as in Section~\ref{app:data}
    \For{$a=1,2,\cdots,K_1$}
        \For{$r=1,2,\cdots,R$}
            \State Compute $K_{2,r}, k_r$ as in Section~\ref{app:data}
            \State $P_j\leftarrow$ sample each element in $X$ with probability $\min\{\frac{1}{2^r n f_{\mathsf{KDE}}}, 1\}$.
            \State $\mathcal{H}_{a,r}.\textsc{Initialize}(P_r,k_r,K_{2,r})$ 
        \EndFor
        \State $\tilde{P}_{a}\leftarrow$ sample elements from $X$, each one has sample probability $\frac{1}{n}$ \Comment{Store $\tilde{P}_a$}
    \EndFor
\EndProcedure
% \State
\State {\bf end data structure}
\end{algorithmic}
\end{algorithm}

Then, we can prove the total space storage of \textsc{DynamicKDE} structure in the following lemma.



\begin{lemma}[Space storage part of Theorem~\ref{thm:main_result}, informal version of Lemma~\ref{lem:init_formal}]\label{lem:init}
The \textsc{Initialize} of the data structure \textsc{DynamicKDE} (Algorithm~\ref{alg:dynamic_KDE_initialize_pseudo}) uses space
\begin{align*}
    & ~ O(\epsilon^{-2}(\frac{1}{f_{\mathsf{KDE}}})^{o(1)}\cdot\log(1 / f_{\mathsf{KDE}})\\
    \cdot & ~ \mathrm{cost}(K)  \cdot\log^2 n\cdot(\frac{1}{f_{\mathsf{KDE}}}+n^{o(1)}\cdot\log^2 n)) 
\end{align*}
\end{lemma}
\begin{proof}[Proof sketch]
The space usage of the \textsc{Initialize} procedure of \textsc{DynamicKDE} mainly comes from $K_1 \cdot R$ copies of the \textsc{LSH} data structure $\mathcal{H}$.
By Lemma~\ref{lem:LSH_storage}, the space usage of each $\mathcal{H}$ is:
\begin{align*}
    & ~ O(Lkdn^{o(1)} + Ln)\\
    = & ~ O(\cost(f)n^{o(1)} \cdot \log^3 n + \cost(f)(1/f_{\mathsf{KDE}}) \cdot \log n).
\end{align*}
Since there are $K_1 \cdot R$ copies of $\mathcal{H}$, the total space usage of \textsc{Initialize} is:
\begin{align*}
    & ~ K_1 \cdot R \cdot O(\cost(K)\log n \cdot (1/f_{\mathsf{KDE}} + \log n))\\
    = & ~ O(\epsilon^{-2}(1/f_{\mathsf{KDE}})^{o(1)} \cdot \log(1/f_{\mathsf{KDE}}) \cdot \cost(K) \\
    \cdot & ~ \log^2 n \cdot (1/f_{\mathsf{KDE}} + n^{o(1)} \cdot \log^2 n))
\end{align*}
which is by $K_1 = O(\epsilon^{-2}(1/f_{\mathsf{KDE}})^{o(1)} \cdot \log n)$ and $R = O(\log(1/f_{\mathsf{KDE}}))$.
\end{proof}


For the running time, we again start with the LSH part.

\begin{lemma}[Upper bound on running time of \textsc{Initialize} of the data-structure \textsc{LSH}, informal version of Lemma~\ref{lem:LSH_initialize_upper_bound_formal}]\label{lem:LSH_initialize_upper_bound}
Given input data points $\{x_i\}_{i\in[n]}\subset \mathbb{R}^d$, parameters $k,L\in \mathbb{N}_+$, LSH parameters $p_{\mathrm{near}},p_{\mathrm{far}} \in [0,1],c\in[1,\infty), r\in \R_+$ and kernel $K$, the \textsc{Initialize} of the data-structure \textsc{LSH} (Algorithm~\ref{alg:LSH_public_app}) runs in time
% \begin{align*}
   $O(L\cdot(kdn^{o(1)}+dn^{1+o(1)}+n\log n))$.
% \end{align*}
\end{lemma}

 

Having shown the running time of LSH, we now move on to prove the total running time of \textsc{Init} in our data structure by combining the above result in the LSH part.

\begin{lemma}[The initialize part of Theorem~\ref{thm:main_result}, informal version of Lemma~\ref{lem:dynamic_KDE_initialize_formal}]\label{lem:dynamic_KDE_initialize}
Given $(K:\mathbb{R}^d\times \mathbb{R}^d \rightarrow [0,1], X \subset\mathbb{R}^d, \epsilon \in (0,1),f_{\mathsf{KDE}}\in[0,1])$, the \textsc{Initialize} of the data-structure \textsc{DynamicKDE} (Algorithm~\ref{alg:dynamic_KDE_initialize_pseudo}) runs in Eq.~\eqref{eq:time_1} time.
% \begin{align*}
%     O(\epsilon^{-2}n^{1+o(1)}\cost(f)\cdot (\frac{1}{f_{\mathsf{KDE}}})^{o(1)}\log(1 / f_{\mathsf{KDE}}) \cdot \log^2 n)
% \end{align*}
\end{lemma}

\begin{proof}[Proof sketch]
The \textsc{Initialize} procedure of \textsc{DynamicKDE} has two nested for loops. The outer loop repeats $K_1 = O(\epsilon^{-2}\log(n)\cdot f_{\mathsf{KDE}}^{-o(1)})$ times and the inner loop repeats $R = O(\log 1/f_{\mathsf{KDE}})$ times.
The running time of each iteration of the inner loop consists of $O(\log(1/f_{\mathsf{KDE}}))$ time for lines~\ref{lin:c_ij}-line~\ref{lin:sample_prob}, $O(n)$ time for line~\ref{lin:sample}, and $O(n^{1+o(1)}\cost(K)\cdot\log^2 n)$ time for line~\ref{lin:LSH_initialize_in_KDE}, by Lemma~\ref{lem:LSH_initialize_upper_bound}.
Thus, the total time for each iteration of the inner loop is $O(n^{1+o(1)}\cost(K)\cdot\log^2 n)$.
Since the inner loop repeats $R$ times and the outer loop repeats $K_1$ times, the total running time is:
$K_1 \cdot R \cdot O(n^{1+o(1)}\cost(K)\cdot\log^2 n)$
$= O(\epsilon^{-2}n^{1+o(1)}\cost(f)\cdot(1/f_{\mathsf{KDE}})^{o(1)}\cdot\log(1/f_{\mathsf{KDE}})\cdot\log^3 n)$ (Eq.~\eqref{eq:time_1})
where the last step uses $K_1 = O(\epsilon^{-2}\cdot f_{\mathsf{KDE}}^{-o(1)}\cdot\log n)$ and $R = O(\log(1/f_{\mathsf{KDE}}))$.
\end{proof}


 

\begin{algorithm}[!ht]\caption{Dynamic KDE, update part, informal version of Algorithm~\ref{alg:dynamic_KDE_update}}\label{alg:dynamic_KDE_update_pseudo}
\begin{algorithmic}[1]
\State {\bf data structure} \textsc{DynamicKDE} \Comment{Theorem~\ref{thm:main_result}}
 \State
\Procedure{\textsc{Update}}{$v \in \R^d, f_{\mathsf{KDE}} \in [0,1], i \in [n]$} \Comment{Lemma~\ref{lem:dynamic_KDE_update}}
    \For{$a=1,2,\cdots,K_1$}
        \For{$r=1,2,\cdots,R$}
            \State Update the hashtables in $\mathcal{H}_{a,r}$ with $v$
        \EndFor
    \EndFor
    \State Replace $x_i$ with $v$
\EndProcedure
\State
\State {\bf end data structure}
\end{algorithmic}
\end{algorithm}


\subsection{Update Part of Data Structure}

We move to the update part of our data structure.
We show how to update the LSH data structure. Then we can extend the update procedure to \textsc{DynamicKDE} structure to prove Lemma~\ref{lem:dynamic_KDE_update}.

\begin{lemma}[Update time of LSH, informal version of Lemma~\ref{lem:hash_update_formal}]\label{lem:hash_update}
Given a data point $v \in\mathbb{R}^d$ 
and index $i\in[n]$, the \textsc{UpdateHashTable} of the data-structure \textsc{LSH} (Algorithm~\ref{alg:LSH_public_app}) runs in (expected) time
% \begin{align*}
   $O(n^{o(1)}\log(n)\cdot \cost(f))$.
% \end{align*}
 
\end{lemma}

\textsc{Update} in LSH structure is a key part in \textsc{Update} for \textsc{DynamicKDE}.
Next, we show the running time of \textsc{Update} for \textsc{DynamicKDE} by combining the above results.


\begin{lemma}[The update part of Theorem~\ref{thm:main_result}, informal version of Lemma~\ref{lem:dynamic_KDE_update_formal}]\label{lem:dynamic_KDE_update}
Given an update $v \in\R^d$, $i\in[n]$, the \textsc{Update} of the data-structure \textsc{DynamicKDE} (Algorithm~\ref{alg:dynamic_KDE_update_pseudo}) runs Eq.~\eqref{eq:time} time.
% \begin{align*}
%     O(\epsilon^{-2}n^{o(1)}\cost(f)\cdot (\frac{1}{f_{\mathsf{KDE}}})^{o(1)}\log(1 / f_{\mathsf{KDE}}) \cdot \log^3 n).
% \end{align*}
\end{lemma}

\begin{proof}[Proof sketch]
The \textsc{Update} procedure of \textsc{DynamicKDE} has two nested for loops. The outer loop repeats $K_1 = O(\epsilon^{-2}\log(n)\cdot f_{\mathsf{KDE}}^{-o(1)})$ times and the inner loop repeats $R = O(\log 1/f_{\mathsf{KDE}})$ times.
The running time of each iteration of the inner loop is dominated by line~\ref{update_hashtable_in_dynamic_KDE}, which takes $O(n^{o(1)}\log(n)\cdot\cost(f))$ time by Lemma~\ref{lem:hash_update}.
Since the inner loop repeats $R$ times and the outer loop repeats $K_1$ times, the total running time is:
$K_1 \cdot R \cdot O(n^{o(1)}\cost(f)\cdot\log^2 n)= O(\epsilon^{-2}n^{o(1)}\cost(f)\cdot(1/f_{\mathsf{KDE}})^{o(1)}\log(1/f_{\mathsf{KDE}})\cdot\log^3 n)$.
\end{proof}

\subsection{Query Part of Data Structure}

Finally, we come to the query part. The goal of this section is to prove Lemma~\ref{lem:dynamic_KDE_query}, which shows the running time of \textsc{Query} procedure for \textsc{DynamicKDE}.


\begin{algorithm}[!ht]\caption{Dynamic KDE, query part, informal version of Algorithm~\ref{alg:dynamic_KDE_query}}\label{alg:dynamic_KDE_query_pseudo}
\begin{algorithmic}[1]
\State {\bf data structure} \textsc{DynamicKDE} \Comment{Theorem~\ref{thm:main_result}}
\State
 
\Procedure{\textsc{Query}}{$q\in \mathbb{R}^d, \epsilon \in (0,1),f_{\mathsf{KDE}} \in [0,1]$} 
 
    \For{$a=1,2,\cdots,K_1$}
        \For{$r=1,2,\cdots,R$}
            \State Recover near neighbours of $q$ using $\mathcal{H}_{a,r}$
            \State Store them into $\mathcal{S}$
        \EndFor
        \For{$x_{i}\in \mathcal{S}$}  
            \State $w_{i}\leftarrow f(x_{i},q)$
            \If{  {$x_{i}\in L_{r}$ for some $r\in[R]$}}
                \State $p_i \gets \min\{\frac{1}{2^r n f_{\mathsf{KDE}}}, 1\}$
            \EndIf
        \EndFor
        \State $T_{a}\leftarrow\sum_{x_{i}\in\mathcal{S}}\frac{w_i}{p_i}$
    \EndFor
    \State \Return $\mathrm{Median}_{a \in K_1} \{T_{a}\}$
\EndProcedure
\State 
\State {\bf end data structure}
\end{algorithmic}
\end{algorithm}

 

The running time of \textsc{Query} procedure depends on two parts: the number of recovered points in each weight level and the recovery time of LSH.
We first show the expected number of recovered points. 

\begin{lemma}[Expected number of points in level sets, informal version of Lemma~\ref{lem:expect_number_points_level_set_formal}]\label{lem:expect_number_points_level_set}
Given a query $q\in\R^d$ and fix $r\in[R]$. For any $i\in[R+1]$, weight level $L_i$ contributes at most $1$ point to the hash bucket of query $q$.
\end{lemma}

 

Next, we show the running time for LSH to recover points.

\begin{lemma}[Running time for recovering points given a query, informal version of Lemma~\ref{lem:recover_point_from_q_formal}]\label{lem:recover_point_from_q}
Given a query $q\in\R^d$ and $L,R,k\in \mathbb{N}_+$, the \textsc{Recover} of the data-structure \textsc{LSH} runs in (expected) time
% \begin{align*}
    $O(Lkn^{o(1)}+LR)$.
% \end{align*}
\end{lemma}

Combining two lemmas above, we prove the total running time of \textsc{Query} in \textsc{DynamicKDE} structure.

\begin{lemma}[Query part of Theorem~\ref{thm:main_result}, informal version of Lemma~\ref{lem:dynamic_KDE_query_formal}]\label{lem:dynamic_KDE_query}
Given a query $q\in\R^d$, the \textsc{Query} of the data-structure \textsc{DynamicKDE} (Algorithm~\ref{alg:dynamic_KDE_query}) runs in Eq.~\eqref{eq:time} time.
% \begin{align*}
%     O(\epsilon^{-2}n^{o(1)}\log(1 / f_{\mathsf{KDE}})\cdot f_{\mathsf{KDE}}^{-o(1)}\cdot\mathrm{cost}(K)\log^3 n).
% \end{align*}
\end{lemma}




