2014 (modified: 25 Jan 2025)ACML 2014Readers: Everyone
Abstract:The problem we consider in this paper is reinforcement learning with value advice. In this setting, the agent is given limited access to an oracle that can tell it the expected return (value) of an...