Cheap Bandits

Manjesh Kumar Hanawal, Venkatesh Saligrama, Michal Valko, Rémi Munos

2015 (modified: 11 Nov 2022)ICML 2015Readers: Everyone

Abstract: We consider stochastic sequential learning problems where the learner can observe the average reward of several actions. Such a setting is interesting in many applications involving monitoring and ...

0 Replies