2013 (modified: 11 Nov 2022)ICML (2) 2013Readers: Everyone
Abstract:We study a stochastic online learning scheme with partial feedback where the utility of decisions is only observable through an estimation of the environment parameters. We propose a generic pure-e...