%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
%%\documentclass[9pt,a4paper]{article} %define the document class
\documentclass{article}
%\usepackage{nips_sty}

\usepackage[utf8]{inputenc} % allow utf-8 input
\usepackage[T1]{fontenc}    % use 8-bit T1 fonts

\usepackage{algorithm2e, algorithmic}
\usepackage{amssymb,url,amsmath, relsize}
\usepackage{mathrsfs, enumerate}
\usepackage{graphicx,subfigure}


%using equations, math fonts
\usepackage{amsmath,bm,amsthm,amssymb}

%using contents tables, index
\usepackage{makeidx}

%using graphs
\usepackage{graphicx}

%for locating tables. graphs
\usepackage{float}

%using colors, links
\usepackage{color}
\usepackage{xcolor}
\usepackage[colorlinks,linkcolor=blue,anchorcolor=blue,citecolor=blue]{hyperref}

%data time related
\usepackage[level]{datetime}

%Citation style
\usepackage[authoryear]{natbib} %for author year style
%\usepackage{natbib} %normal style

\setlength{\parskip}{\baselineskip}%
\setlength{\parindent}{0pt}%

%all kinds of marks, signs
\usepackage{wasysym}
\usepackage{marvosym}

%If you want to specify the content size on the page
\usepackage{geometry}
\geometry{left=2.5cm,right=2.5cm,top=2.5cm,bottom=2.5cm}

%if you want to count pages 
\usepackage{lastpage}

%Page heads and foots
\usepackage{fancyhdr}
\pagestyle{fancy}
\fancyhf{}
\fancyhead[L]{\scriptsize\textit{Appendix}}
\fancyhead[R]{\scriptsize\textit{CobBO: Coordinate backoff Bayesian Optimization}}
\fancyfoot[R]{\scriptsize{page \thepage/\pageref{LastPage}}}
\fancyfoot[L]{\scriptsize\textit{ }}
\renewcommand{\headrulewidth}{0pt} %hide the line
\setlength{\headsep}{0.5cm} %set the separation

%support author block
\usepackage{authblk}
%Title and author
\title{supplementary material}
\author[1]{ author1}
\author[2]{ author2}
\affil[1]{affil 1}
\affil[2]{affil 2}
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

%%%%% PDF Propertity Info
\hypersetup{pdfauthor={Author},%
            pdftitle={A document prepared with TeXpen},%
            pdfsubject={the subject},%
            pdfkeywords={one, two},%
            pdfproducer={LaTeX},%
            pdfcreator={LaTeX and TeXpen}
}

%%% Drawing a picture
%\usepackage[Gray]{SIunits}
%\usepackage{colortbl}
%\usepackage[dvipsnames,pdftex,fixpdftex]{xcolor}
\usepackage{tikz}
\usetikzlibrary{decorations.pathmorphing}

\begin{document}
%%make the title and author list and footnotes
%%\maketitle

\section*{Additional features of CobBO}
\textbf{Data filtering by K-means classification:}
Dealing with the cubic computation cost in queries~\cite{snoek2012}, instead of using the sophisticated approximated Gaussian process regression~\cite{candela2005,bui2017}, above some quantity of aggregated observations, e.g. $1000$, we leverage the K-means algorithm~\cite{macqueen1967some} for discarding clustered points.
Specifically, we only keep the point of maximal value within each cluster. 
 Intuitively, if two nearby points have close function values,  discarding the smaller one for a maximization problem seems innocuous. Sometimes, it could even be better, since Bayesian optimization assumes the function $f(x)$ to be smooth, from a reproducing kernel Hilbert space~\cite{bull2011}.
 

\textbf{Batch queries:} %\label{ss:batch}
Due to sampling subspaces,  CobBO can be easily paralleled in a batch mode.  
Specifically, we can sample multiple coordinate subspaces, each containing the latest observed pivot point $V_t$. 
Since the batch mode does not require synchronization, multiple concurrent subspaces may not necessarily use an identical $V_t$.
In principle, we can integrate other batch methods~\cite{turbo2019,desautels14,emile2013,javier2016,tarun2016,javad2010,desautels14,wilson2017reparameterization} with CobBO.

\section*{Experimental environment}%\vspace{-5mm}

All of the experiments in this paper are conducted on common PCs with Linux/MacOSX systems. No special computational equipments are required. 
% \vspace{-2mm}

\section*{Impacts of the key/auxiliary features on the performance}\vspace{-5mm}

CobBO is configured with a default setting, including a stopping rule for determining the number of 
consistent queries and the strategies to form coarse and refined trust regions on slow and fast time scales, respectively. 
In order to compare the impact of the configurations, we test the following combinations. 
\begin{itemize}\vspace{-3mm}%\setlength\itemsep{0em}
\item consistent query $\in \{\rm{stopping\;rule}, \;\rm{fixed\; constant}\; q_{\rm{max}}\}$ %with $q_{\rm{max}}$ being the maximum number of consistent queries
\item $S \in\{\rm{true},\rm{false}\}$, whether or not to employ coarse trust regions on a slow time scale
\item $F \in\{\rm{true},\rm{false}\}$, whether or not to employ refined trust regions on a fast time scale
\end{itemize}\vspace{-3mm}
%
The fixed constant $q_{\rm{max}}$ represents the maximum number of consistent queries that can be continuously imposed to the 
currently selected coordinate subspace. 
It is a tradeoff between exploiting the potentials of the current coordinate subspace and exploring other new subspaces. 
%Conceptually, more consistent queries exploit the potential of the coordinate subspace, at the risk of missing better solutions of other subspaces due to the limited total budget. 
%
When coarse trust regions are enabled (i.e., $S=\rm{true}$) on a slow time scale, 
the procedure will exploit a neighborhood of $V_{t}$ instead of the full domain.  
%
If refined trust regions are used (i.e., $F=\rm{true}$) on a fast time scale, the alternation between coarse and refined trust regions
can help Bayesian optimization to better exploit the selected regions centering at~$V_{t}$. 
This alternation can help to distribute the new queries on both the centering area as well as the boundary areas. 
%
%Coarse trust regions can be considered as a trade-off between the refined small trust regions and the original domain.  
We use extensive experiments to empirically test how these features could contribute to the performance of CobBO. 
\vspace{-2mm}



We apply CobBO on 30 dimensional synthetic functions (Ackley, Levy and Rastrigin) and the robot pushing problem 
using $6$ different settings,  as shown in the following table:

\begin{table}[h]
%\caption{Table Caption}
\label{tab:settings}
\begin{center}
\begin{tabular}{lcccccc}
\hline
         &  $\rm{CobBO}^{\ast}$  & $\rm{CobBO}^{1}$ & $\rm{CobBO}^{2}$ & $\rm{CobBO}^{3}$ & $\rm{CobBO}^{4}$ & $\rm{CobBO}^{5}$ \\ 
\hline
$q_{\rm{max}}$    & stopping rule      &  stopping rule         & stopping rule        & stopping rule       &  1       & 15 \\
$S$  &true  &  false    & true   &  false  & true    & true  \\
$F$  & true  &  false    & false  &  true   &  true   & true   \\
\hline
\end{tabular}
\end{center}
\end{table}
Note that $\rm{CobBO}^{\ast}$ is the default setting that we have used to generate the experimental results in the main part of this paper. 
Based on the previous setup, we assign a budget of $2,500$ evaluations to Ackley, Levy and Rastrigin, and $7,000$ evaluations 
to the robot pushing problem.
For each configuration, we plot 95\% confidence intervals by repeating 30 independent experiments for each problem.  
The tested value $q_{\rm{max}}$ is chosen to be $2$ for 2500 evaluations and $3$ for $7,000$ evaluations. 


\begin{figure}[hbt]
\begin{center}
\includegraphics[width=0.98\columnwidth,height=!]{app-synthetic-30}
\end{center}\vspace{-9mm}
\caption{Performance of different configurations on synthetic problems}
\label{fig:d30}
\end{figure}
\vspace{-2mm}

On these three synthetic problems, performance of the different configurations are close to each other,  as shown in Fig.~\ref{fig:d30}. 
It indicates that in these cases CobBO is not sensitive to these configurations. 
However, small differences still exist for the experiments. \vspace{-2mm}
%
%
%On average, CobBO(3), CobBO(4)  and CobBO(5) outperform the other two groups eventually on all three problems. 
%It implies that with fast trust regions CobBO tends to get better global performance.

%CobBO(1) performs better than CobBO(2) at early stages, while CobBO(2) eventually catches up with or even surpasses CobBO(1) . Similar patterns can also be found between CobBO(3) and CobBO(4).  With slow trust regions enabled, CobBO(2) and CobBO(4) are able to explore regions with finer granularity. This exploitation-inclined setting makes CobBO converge slowly in the early stage while achieve competitive solution qualities in the end. 
%
$\rm{CobBO}^{5}$ with a larger $q_{\rm{max}}$ value performs slightly worse than $\rm{CobBO}^{3}$ and  $\rm{CobBO}^{4}$, 
while better than  $\rm{CobBO}^{1}$ and  $\rm{CobBO}^{2}$. It implies that $q_{\rm{max}}$ and $F$ have stronger impacts on the performance than 
$S$ in these cases. 
%
With the fast trust region feature enabled ($F = \rm{true}$),   
$\rm{CobBO}^{3}$ encourages more exploitation within smaller neighborhoods around the current best solutions,  and 
consistently outperforms  $\rm{CobBO}^{1}$ and  $\rm{CobBO}^{2}$ on all three problems. \vspace{-2mm}


%It indicates that enabling the fast trust region feature, which encourages global exploration, contributes to better solution qualities in these cases.



 \begin{figure}[hbt]
\begin{center}
\includegraphics[width=0.6\columnwidth,height=!]{rpush}
\end{center}\vspace{-9mm}
\caption{Performance of different configurations on the robot pushing problem}
\label{fig:push}
\end{figure}
\vspace{-2mm}

On the robot pushing problem, as shown in Fig.~\ref{fig:push}, the results from the $6$ configurations are not significantly different to each other, too. 
Specifically, $\rm{CobBO}^3$  slightly outperforms the other settings on average, similar to the above experiments. 
$\rm{CobBO}^5$ performs badly, possibly due to its excessive exploitation in the current coordinate subspaces. 
Different from the above cases, $\rm{CobBO}^1$ and $\rm{CobBO}^2$ find better solutions than  $\rm{CobBO}^4$ and $\rm{CobBO}^5$ on average. 
It suggests that restricting the procedure within refined trust regions may have a negative impact on the performance in this case.
The default setting is configured with refined trust regions. Thus, it is not as good as  $\rm{CobBO}^1$ and $\rm{CobBO}^2$. 
It indicates that better adaptive algorithms can be designed to further improve the performance of CobBO. 




\section*{The small variance on the 200 dimensional Levy function }\vspace{-5mm}

Fig. 5 in the manuscript cannot clearly show the variances of the sample paths of CobBO on the 200 dimensional Levy function. 
We zoom in the details and provide two partial views, as shown in Fig.~\ref{fig:200d-levy}.

 \begin{figure}[hbt]
\begin{center}
\includegraphics[width=0.6\columnwidth,height=!]{figures/200d-large.png}
\end{center}\vspace{-9mm}
\caption{Zoom in to show the variances in Fig. 5}
\label{fig:200d-levy}
\end{figure}
\vspace{-2mm}


%\section*{Half-cheetah reinforcement learning benchmark}\vspace{-5mm}
%The average award of this experiment using CobBO is around $4000$, which is better than other tested Bayesian optimization methods. 
%However, it is still not as good as Augmented Random Search, which reports a reward 






\clearpage
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
%% The bib style options
%\bibliographystyle{apa}%{ieeetr}%{plain}
%% The bib file
%\bibliography{somefile.bib}

\end{document}
%%%%%%%%%%%%% END of DOCUMENT %%%%%%
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%