\documentclass[twoside]{article}

\usepackage[accepted]{aistats2024}

\usepackage[utf8]{inputenc} % allow utf-8 input
\usepackage[T1]{fontenc}    % use 8-bit T1 fonts
\usepackage{hyperref}       % hyperlinks
\usepackage{url}            % simple URL typesetting
\usepackage{booktabs}       % professional-quality tables
\usepackage{amsfonts}       % blackboard math symbols
\usepackage{nicefrac}       % compact symbols for 1/2, etc.
\usepackage{microtype}      % microtypography
\usepackage{xcolor}         % colors

\usepackage{algorithm}
\usepackage{algorithmic}
\renewcommand{\algorithmiccomment}[1]{\hfill {\color{blue}$\triangleright$} #1}

\usepackage{natbib} % has a nice set of citation styles and commands
    \bibliographystyle{plainnat}
    \renewcommand{\bibsection}{\subsubsection*{References}}
\usepackage{mathtools} % amsmath with 

\usepackage{amsmath}
\usepackage{amssymb}
\usepackage{mathtools}
\usepackage{amsthm}
\usepackage{dsfont}

\usepackage{microtype}
\usepackage{graphicx}
\usepackage{subfigure}


\newcommand{\fix}{\marginpar{FIX}}
\newcommand{\new}{\marginpar{NEW}}

\newtheorem{assumption}{Assumption}
\newtheorem{example}{Example}
\newtheorem{definition}{Definition}
\newtheorem{remark}{Remark}
\newtheorem{theorem}{Theorem}
\newtheorem{corollary}{Corollary}
\newtheorem{lemma}{Lemma}
\newtheorem{claim}{Claim}
\newtheorem{proposition}{Proposition}
\newtheorem{regret}{Regret}
\newtheorem{gap}{Gap}


\def \bvarphi {\mathrm{\boldsymbol{\varphi}}}
\def \btheta {\bm \theta}
\def \bsigma {\bm \Sigma}
\def \mt {\mathsf{T}}
\def \bV {\displaystyle\mV}
\def \bx {\displaystyle\vx}
\def \bA {\displaystyle\sA}
\def \bC {\displaystyle\sC}
\def \bD {\displaystyle\sD}
\def \bR {\displaystyle\sR}
\def \bT {\mathcal{T}}
\def \bI {\bold{I}}
\def \bb {\displaystyle\vb}
\def \bD {\mathcal{D}}
\def \bR {\mathcal{R}}
\def \E {\mathcal{E}}
\def \C {\mathcal{C}}
\def \A {\mathcal{A}}
\def \bP {\mathcal{P}}
\def \bn {\displaystyle\vn}
\def \bc {\mathcal{C}}
\def \ba {\bold{a}}
\def \bone {\mathds{1}}
\def \bE {\mathds{E}}
\def \bB {\mathbb{B}}
\def \bN {\mathcal{N}}
\def \bbN {N}
\def \M {\mathcal{M}}
\def \x {\bold{x}}
\def \t {\boldsymbol{\theta}}
\def \R {\mathcal{R}}
\def \I {\mathcal{I}}
\def \y {\bold{y}}
\def \V {\bold{V}}
\def \b {\bold{b}}

\newcommand{\huazheng}[1]{\textcolor{blue}{[Huazheng: #1]}}
\newcommand{\chuanhao}[1]{\textcolor{blue}{[Li: #1]}}
\newcommand{\zichen}[1]{\textcolor{red}{[Zichen: #1]}}

\begin{document}

\twocolumn[

\aistatstitle{Pure Exploration in Asynchronous Federated  Bandits}

\aistatsauthor{ Zichen Wang\\ 
  Southwest University\\
  \texttt{swuzcw@gmail.com} \\
  \And
  Chuanhao Li\\
Yale University\\
  \texttt{chuanhao.li.cl2637@yale.edu}\\
  \And
  Chenyu Song\\
  Oregon State University\\
  \texttt{songchen@oregonstate.edu}
  \AND
  Lianghui Wang\\
  Oregon State University\\
  \texttt{wangl9@oregonstate.edu}\\
  \And
  Quanquan Gu\\
  UCLA\\
  \texttt{qgu@cs.ucla.edu}\\
  \And
  Huazheng Wang\\
  Oregon State University\\
\texttt{huazheng.wang@oregonstate.edu}
}

]

\begin{abstract}
We study the federated pure exploration problem of multi-armed bandits and linear bandits, where $M$ agents cooperatively identify the best arm via communicating with the central server. 
To enhance the robustness against latency and unavailability of agents that are common in practice, we propose the first federated asynchronous multi-armed bandit and linear bandit algorithms for pure exploration with fixed confidence. Our theoretical analysis shows the proposed algorithms achieve near-optimal sample complexities and efficient communication costs in a fully asynchronous environment. 
Moreover, experimental results based on synthetic and real-world data empirically elucidate the effectiveness and communication cost-efficiency of the proposed algorithms.
\end{abstract}

\input{Introduction}

\input{Related_works}

\input{Preliminary}

\input{Tabular}

\input{Linear}

\input{Experiment}

\section{CONCLUSION}
 In this paper, we propose the first study on the pure exploration problem of both federated MAB and federated linear bandits in an asynchronous environment. First, we proposed an algorithm named \texttt{FAMABPE}, which can complete the $(\epsilon,\delta)$-pure exploration object of the federated MAB with  $\tilde{O}(H^M_\epsilon)$ sample complexity and $\tilde{O}(MK)$ communication cost using a novel  event-triggered communication protocol. Then, we improved \texttt{FAMABPE} to \texttt{FALinPE}, which can finish the same object in the linear case with $\tilde{O}(H^L_\epsilon d)$ sample complexity and  $\tilde{O}\big(\max(M^{2}d,MK)\big)$ communication cost. At the end of the paper, the effectiveness of the offered algorithms was further examined by the numerical simulation based on synthetic data and real-world data. In our future work, a potential direction is to investigate federated asynchronous pure exploration algorithms with a fixed budget. 

\bibliography{Reference.bib}

\newpage

\input{Appendix}

\end{document}