Generic Exploration and K-armed Voting BanditsDownload PDFOpen Website

2013 (modified: 11 Nov 2022)ICML (2) 2013Readers: Everyone
Abstract: We study a stochastic online learning scheme with partial feedback where the utility of decisions is only observable through an estimation of the environment parameters. We propose a generic pure-e...
0 Replies

Loading