Cheap BanditsDownload PDFOpen Website

2015 (modified: 11 Nov 2022)ICML 2015Readers: Everyone
Abstract: We consider stochastic sequential learning problems where the learner can observe the average reward of several actions. Such a setting is interesting in many applications involving monitoring and ...
0 Replies

Loading