Imitation-Regularized Offline LearningDownload PDFOpen Website

Published: 2019, Last Modified: 12 May 2023AISTATS 2019Readers: Everyone
Abstract: We study the problem of offline learning in automated decision systems under the contextual bandits model. We are given logged historical data consisting of contexts, (randomized) actions, and (non...
0 Replies

Loading