2019 (modified: 11 Nov 2022)ICML 2019Readers: Everyone
Abstract:We study Lipschitz bandits, where a learner repeatedly plays one arm from an infinite arm set and then receives a stochastic reward whose expectation is a Lipschitz function of the chosen arm. Most...