Warm-starting Contextual Bandits: Robustly Combining Supervised and Bandit FeedbackDownload PDFOpen Website

2019 (modified: 11 Nov 2022)ICML 2019Readers: Everyone
Abstract: We investigate the feasibility of learning from both fully-labeled supervised data and contextual bandit data. We specifically consider settings in which the underlying learning signal may be diffe...
0 Replies

Loading