2021 (modified: 21 Dec 2021)COLT 2021Readers: Everyone
Abstract:We study the multi-armed bandit problem with subGaussian rewards. The explore-then-commit (ETC) strategy, which consists of an exploration phase followed by an exploitation phase, is one of the mos...