2021 (modified: 05 Feb 2023)ICML 2021Readers: Everyone
Abstract:Recent work has considered natural variations of the {\em multi-armed bandit} problem, where the reward distribution of each arm is a special function of the time passed since its last pulling. In ...