2014 (modified: 11 Nov 2022)ICML 2014Readers: Everyone
Abstract:In online learning, a player chooses actions to play and receives reward and feedback from the environment with the goal of maximizing her reward over time. In this paper, we propose the model of c...