2020 (modified: 17 May 2023)ICML 2020Readers: Everyone
Abstract:We study reward maximisation in a wide class of structured stochastic multi-armed bandit problems, where the mean rewards of arms satisfy some given structural constraints, e.g. linear, unimodal, s...