2022 (modified: 25 Jan 2023)COLT 2022Readers: Everyone
Abstract:In \emph{bandit with distribution shifts}, one aims to automatically adapt to unknown changes in reward distribution, and \emph{restart} exploration when necessary. While this problem has been stud...