\documentclass{article}

\usepackage{aistats2024_author_response}

\usepackage[utf8]{inputenc} % allow utf-8 input
\usepackage[T1]{fontenc}    % use 8-bit T1 fonts
\usepackage{hyperref}       % hyperlinks
\usepackage{url}            % simple URL typesetting
\usepackage{booktabs}       % professional-quality tables
\usepackage{amsfonts}       % blackboard math symbols
\usepackage{nicefrac}       % compact symbols for 1/2, etc.
\usepackage{microtype}      % microtypography
\usepackage{xcolor}         % define colors in text
\usepackage{xspace}         % fix spacing around commands
\usepackage{setspace}

\newcommand\ehb[1]{\textcolor{blue}{[Blaser]: #1}}

\begin{document}
% \begin{spacing}{1.00}
We thank all reviewers for their constructive comments and suggestions, which will clearly make our paper stronger.

%%%Reviewer 1

\textbf{Response to R\#1.} 

\textbf{Q1:} Is a lower bound like Theorem 2 known in the synchronous setting? If yes, how difficult is it to adapt it to the asynchronous case?
\textbf{A1:} There is no existing communication lower bound for the synchronous federated pure exploration problem. Note that the synchronous case can be seen as a special case of the asynchronous case, therefore, our communication lower bound is the first communication lower bound for the federated pure exploration problem.

\textbf{Q2:} 1) What is $T$ in Theorem 2? 2) Why was $\gamma = 1/M^{3/2}K$ used in Corollary 1? 3) What is the intuition behind "local" $\beta$ in eq. (6) depending on $M$?
\textbf{A2:} 1) Sorry for the typo. It should be  $\tau$ instead of $T$. 2) We thank the reviewer for the sharp observation. $\gamma = 1/2MK$ is a better choice to balance the trade-off between communication cost and the sample complexity. 3) In the appendix, we show that with our communication protocol, $t \le (1 + \gamma M)\sum_{k=1}^K T_t^{ser}(k)$ (see Lemma 4). In asynchronous federated pure exploration problem, the server and agents do not know the time index $t$, and they can only utilize the upper bound $(1 + \gamma M)\sum_{k=1}^K T_t^{ser}(k)$ to build the confidence radius (see Remark 2).

\textbf{Q3:} Would using rewards in real time to update arm pulls be better? Why is a low switching cost useful (Remark 1)?
\textbf{A3:} In Theorem 1 we show that the current update law leads to an optimal sample complexity upper bound that matches the lower bound. Therefore, using rewards in real time to update arm pulls can not lead to a tighter bound. However, empirically this may lead to better performance. The details of low switching cost are shown in Abbasi-Yadkori et al., 2011 and He et al., 2022.

%%%Reviewer 2

\textbf{Response to R\#2.} 
% \textbf{Q1:} 
% The Theorem 2 is incorrect.

\textbf{Q1:} Correctness of Theorem 2
\textbf{A1:}
We want to point out some misunderstanding in R2's comments on our lower bound analysis. Specifically, the term $p_m$ should not appear on the RHS of Eq (41), since by construction \textbf{Aux} is a single agent algorithm, so Lemma 6 applies. We also want to clarify that R2' argument of ``replacing \textbf{FedAlg} with an algorithm that just takes a constant number of samples" is incorrect under the fixed confidence setting studied in our paper, as it is out of algorithm's control.

\textbf{Q2:} Not only does the algorithm need to send information about all $K$ arms in each round of communication, the number of communication rounds also scales with K, which is counter-intuitive. 
\textbf{A2:} $C(\tau)$'s dependence on $K$ is due to the choice of $\gamma$ to cancel out $K$ in the ratio $(1+\gamma M)/(1-\gamma K M)$ that appears in the sample complexity bound. While we posit that this reliance on K could potentially be mitigated via enhanced algorithmic design, we assert our pioneering contribution as the inaugural investigation into this specific problem.

\textbf{Q3:} 1) The choice of $\gamma$. 2) Advantage of having two trigger conditions in the linear bandits. %The choice of action is not affected by the condition on $T_{loc}$ and $T_{ser}$ so I am curious if there is any particular advantage of including that condition.
\textbf{A3:} We sincerely thank the reviewer for pointing out $\gamma = 1/2MK$ can result in a smaller communication cost upper bound (i.e., $\tilde{O}(MK)$). We will revise the result in the future version. The reason of utilize two trigger conditions is that we can utilize the property of the trigger conditions to simply bound the global observation number and the global covariance matrix with local data. Therefore, we can directly establish the confidence radius in the linear case due to it dependents on the global covariance matrix and the observation number. 

%%%Reviewer 4

\textbf{Response to R\#4.} 

\textbf{Q1:} I don't understand why $\tau$ appears on the right side of its own bound. Also, the definition of $\tau (= \sum T^{ser} + \sum \sum T^{loc})$ should appear in the main part of the paper. And finally, using runtime sampling counting terms such as $T_m$ or $T^{ser}$ should not appear on the right side of the bound. The bound should only depend on the problem class parameters. Also, why is $C_m$ mentioned in the statement of Theorem 3?
\textbf{A1:} We showed $\tau$  that only appears on the left side in Corollary 1 and 2. Besides, in Corollary 1 and 2, there is no $T_m$, and $T^{ser}$ appears on the right side of the upper bound. We will add a clarification in the future version.


\textbf{Q2:} Code and additional sanity checks.
% \looseness=-1
\textbf{A2:} We will publish the code in the final version and provide more experiment results based on the reviewer's suggestion.

%%%Reviewer 6

\textbf{Response to R\#6.} 

\textbf{Q1:} Clarifying Theorem 1 and Theorem 3.
\textbf{A1:} In the upper bound of $\tau$ in Corollary 1 and 2, $\tau$ only appears on LHS. We will clarify this in the final version.

\textbf{Q2:} 1) Lower bound for linear case. 2) It is better to maintain uniformity in line styles and legend for the same algorithms across figures. 3)Codes for reproducing the experimental results is missing.
\textbf{A2:} Thank you for your advice, we will follow your suggestion to modify the current paper.

\end{document}
