Published: 2021, Last Modified: 12 May 2023UAI 2021Readers: Everyone
Abstract:We study the problem of identifying the best arm in a stochastic multi-armed bandit game. Given a set of $n$ arms indexed from $1$ to $n$, each arm $i$ is associated with an unknown reward distribu...