PAC Lower Bounds and Efficient Algorithms for The Max \(K\)-Armed Bandit ProblemDownload PDFOpen Website

2016 (modified: 11 Nov 2022)ICML 2016Readers: Everyone
Abstract: We consider the Max K-Armed Bandit problem, where a learning agent is faced with several stochastic arms, each a source of i.i.d. rewards of unknown distribution. At each time step the agent choose...
0 Replies

Loading