Optimal Algorithms for Lipschitz Bandits with Heavy-tailed Rewards

Shiyin Lu, Guanghui Wang, Yao Hu, Lijun Zhang

2019 (modified: 11 Nov 2022)ICML 2019Readers: Everyone

Abstract: We study Lipschitz bandits, where a learner repeatedly plays one arm from an infinite arm set and then receives a stochastic reward whose expectation is a Lipschitz function of the chosen arm. Most...

0 Replies