Imitation-Regularized Offline Learning

Yifei Ma, Yu-Xiang Wang, Balakrishnan Narayanaswamy

Published: 2019, Last Modified: 12 May 2023AISTATS 2019Readers: Everyone

Abstract: We study the problem of offline learning in automated decision systems under the contextual bandits model. We are given logged historical data consisting of contexts, (randomized) actions, and (non...

0 Replies