Published: 2019, Last Modified: 12 May 2023AISTATS 2019Readers: Everyone
Abstract:We study the problem of offline learning in automated decision systems under the contextual bandits model. We are given logged historical data consisting of contexts, (randomized) actions, and (non...