Fixed Confidence Best Arm Identification in the Bayesian Setting

Kyoungseok Jang; Junpei Komiyama; Kazutoshi Yamazaki

Fixed Confidence Best Arm Identification in the Bayesian Setting

Kyoungseok Jang, Junpei Komiyama, Kazutoshi Yamazaki

Published: 25 Sept 2024, Last Modified: 06 Nov 2024NeurIPS 2024 posterEveryoneRevisionsBibTeXCC BY 4.0

Keywords: Multi-armed bandit, Best arm identification

TL;DR: Fixed-confidence best arm identification in the Bayesian setting, where the mean vector is drawn from a known prior.

Abstract: We consider the fixed-confidence best arm identification (FC-BAI) problem in the Bayesian setting. This problem aims to find the arm of the largest mean with a fixed confidence level when the bandit model has been sampled from the known prior. Most studies on the FC-BAI problem have been conducted in the frequentist setting, where the bandit model is predetermined before the game starts. We show that the traditional FC-BAI algorithms studied in the frequentist setting, such as track-and-stop and top-two algorithms, result in arbitrarily suboptimal performances in the Bayesian setting. We also obtain a lower bound of the expected number of samples in the Bayesian setting and introduce a variant of successive elimination that has a matching performance with the lower bound up to a logarithmic factor. Simulations verify the theoretical results.

Supplementary Material: zip

Primary Area: Bandits

Submission Number: 5619

Loading