Double Explore-then-Commit: Asymptotic Optimality and BeyondDownload PDFOpen Website

2021 (modified: 21 Dec 2021)COLT 2021Readers: Everyone
Abstract: We study the multi-armed bandit problem with subGaussian rewards. The explore-then-commit (ETC) strategy, which consists of an exploration phase followed by an exploitation phase, is one of the mos...
0 Replies

Loading