\Cref{prop:totnst-ub} and \Cref{thm:upper-bd} imply that our regret bounds are better than that of \citep{dai2023quantum}
in the case of polynomial eigendecay due to the tradeoff parameter $\eta$.
We note that \Cref{alg:qmc-kernel-ucb} with $\eta=1$ is identical to Q-GP-UCB \citep{dai2023quantum}
and \Cref{prop:totnst-ub} suggests that the regret bound can be improved if we take the parameter $\eta$ as an appropriate small value.
In this section, we empirically verify this in a simple synthetic environment using the quantum simulator provided by the qiskit library \cite{qiskit2024}.

In this environment, we define the quantum reward oracle $\cO_x$ as a quantum circuit representing a Bernoulli random variable 
with mean $\mu(x) \in [0, 1]$, where the implementation is provided by the tutorial of qiskit-finace 
\url{https://qiskit-community.github.io/qiskit-finance/tutorials/00_amplitude_estimation.html}.
For an implementation of QMC, we used the iterative amplitude estimation (IAE) \citep{grinko2021iterative} implemented in the qiskit-algorithms library.
Here, similar to \citep{dai2023quantum}, we used a theoretical upper bound of the number of oracle calls
rather than the actual number of oracle calls of IAE.
We consider a simple environment, where $T = 3000$, $\cX = [0, 1]^d$ with $d = 1$, $k$ is the Mat\`ern-$\nu$ kernel with $\nu = 1.5$ and the length-scale $0.3$,
$\mu$ is given as $x \mapsto k(x, x_0)$ with $x_0 = 0.2$.

Since our algorithm in the case when $\eta=1$ is identical to Q-GP-UCB \citep{dai2023quantum},
we have conducted an ablation study of the parameter $\eta$.
Regarding the parameter of \Cref{alg:qmc-kernel-ucb}, we take $\reg = 1.0, \delta/M = 10^{-2}, S=1$.
We run \Cref{alg:qmc-kernel-ucb} in the synthetic environment 10 times
for each $\eta=1.0, 10^{-1}, 10^{-2}, 10^{-3}$ 
and plot the cumulative regret in \Cref{fig:regret}, where the error bands represent 95\% confidence intervals of cumulative regret.
The experimental result supports our theoretical findings, 
i.e., by taking an appropriate (small) value of the parameter $\eta$, \Cref{alg:qmc-kernel-ucb} can achieve a better performance
than the existing method \citep{dai2023quantum}.
Moreover, discussion in \Cref{sec:tradeoff-parameter} suggests that by setting $\eta$ to a small value,
the total number of stages decreases.
In fact, in this experimental setting, the mean total number of stages 
when $\eta=1$ is given as $200.8$ (std $0.4$) and 
that when $\eta = 10^{-2}$ is given as $18.6$ (std $0.49$).

For a better empirical performance, we introduce an exploration parameter $v > 0$ 
to the UCB 
\begin{equation}
   \label{eq:app-ucb-v}
  \muw_{s-1}(x) + v\beta_{s-1} \sigmaw_{s-1} (x).
\end{equation}
We note that the case when $v=1$ is identical to \Cref{alg:qmc-kernel-ucb}.
We conducted experiments using the UCB \eqref{eq:app-ucb-v} with $v=0.5, 0.1, 0.05$ with the same experimental setting
and show cumulative regret in \Cref{fig:regret05,fig:regret01,fig:regret005}.
These experimental results also indicate that with an appropriate choice $\eta$,
we have an improvement over Q-GP-UCB \citep{dai2023quantum}.


\begin{figure}[H]
   \begin{center}
        \includegraphics[width=0.7\linewidth]{images/regret.pdf}
   \end{center} 
   \caption{Ablation study of the parameter $\eta$. The case when $\eta=1$ is identical to the existing method \citep{dai2023quantum}.}
   \label{fig:regret}
\end{figure}

\begin{figure}[H]
   \begin{center}
        \includegraphics[width=0.7\linewidth]{images/regret_v0.5.pdf}
   \end{center} 
   \caption{Ablation study of the parameter $\eta$ with the exploration parameter $v=0.5$ }
   \label{fig:regret05}
\end{figure}


\begin{figure}[H]
   \begin{center}
        \includegraphics[width=0.7\linewidth]{images/regret_v0.1.pdf}
   \end{center} 
   \caption{Ablation study of the parameter $\eta$ with the exploration parameter $v=0.1$}
   \label{fig:regret01}
\end{figure}


\begin{figure}[H]
   \begin{center}
        \includegraphics[width=0.7\linewidth]{images/regret_v0.05.pdf}
   \end{center} 
   \caption{Ablation study of the parameter $\eta$ with the exploration parameter $v=0.05$}
   \label{fig:regret005}
\end{figure}