2018 (modified: 11 Nov 2022)ICML 2018Readers: Everyone
Abstract:Bandit is a framework for designing sequential experiments, where a learner selects an arm $A \in \mathcal{A}$ and obtains an observation corresponding to $A$ in each experiment. Theoretically, the...