Meta-Thompson SamplingDownload PDFOpen Website

2021 (modified: 12 Sept 2021)ICML 2021Readers: Everyone
Abstract: Efficient exploration in bandits is a fundamental online learning problem. We propose a variant of Thompson sampling that learns to explore better as it interacts with bandit instances drawn from a...
0 Replies

Loading