2015 (modified: 11 Nov 2022)ICML 2015Readers: Everyone
Abstract:We consider stochastic sequential learning problems where the learner can observe the average reward of several actions. Such a setting is interesting in many applications involving monitoring and ...