PAC Lower Bounds and Efficient Algorithms for The Max \(K\)-Armed Bandit Problem

Yahel David, Nahum Shimkin

2016 (modified: 11 Nov 2022)ICML 2016Readers: Everyone

Abstract: We consider the Max K-Armed Bandit problem, where a learning agent is faced with several stochastic arms, each a source of i.i.d. rewards of unknown distribution. At each time step the agent choose...

0 Replies