A Framework for Adapting Offline Algorithms to Solve Combinatorial Multi-Armed Bandit Problems with Bandit FeedbackDownload PDFOpen Website

Published: 2023, Last Modified: 11 Sept 2023ICML 2023Readers: Everyone
Abstract: We investigate the problem of stochastic, combinatorial multi-armed bandits where the learner only has access to bandit feedback and the reward function can be non-linear. We provide a general fram...
0 Replies

Loading