This paper extends the quantum linear bandit problem \citep{wan2023quantum} to the kernelized case.
In this study, we consider the case where the rate of decay of eigenvalues of the Mercer operator is polynomially or exponentially fast and provides an upper bound of the cumulative regret.
% For instance, squared exponential (SE) kernels and rational quadratic kernels (RQ) have $1/d$ exponential eigendecay,
For instance, Mat\'ern-$\nu$ kernels have a $1 + 2\nu/d$ polynomial eigendecay, 
and squared exponential (SE) kernels have a $1/d$ exponential eigendecay, if $\cX \subset \RR^d$.
We show that the proposed algorithm achieves 
$\widetilde{O}\left( T^{\frac{3}{1 + \beta_p}} \log\left(\frac{1}{\delta} \right)\right)$
regret bound if the kernel $k$ has a $\beta_p$ polynomial eigendecay, 
and
$\widetilde{O} \left( \log^{3(1 + \beta_e^{-1})/2} (T) \log\left(\frac{1 }{\delta} \right) \right)$
if the kernel $k$ has a $\beta_e$ polynomial eigendecay.
% Here, $\widetilde{O}$ notation ignores factors of poly-logarithmic terms.
This result indicates that the proposed method exponentially improves compared with that of classical algorithms \citep{valko2013finite,vakili2021information} under the condition of the exponential eigendecay.
We summarized the relevant study in \Cref{tab:comp}.
We shall defer all the omitted proofs to Appendix.
% The proof technique is also used in the analysis of maximum information gain, and we derive the upper bound of the regret by following the technique.
% }