The Batch Complexity of Bandit Pure Exploration

Adrienne Tuynman; Rémy Degenne

The Batch Complexity of Bandit Pure Exploration

Adrienne Tuynman, Rémy Degenne

Published: 17 Jul 2025, Last Modified: 07 Oct 2025EWRL 2025 PosterEveryoneRevisionsBibTeXCC BY 4.0

Keywords: Bandits, Best arm identification, batched

Abstract: In a fixed-confidence pure exploration problem in stochastic multi-armed bandits, an algorithm iteratively samples arms and should stop as early as possible and return the correct answer to a query about the arms distributions. We are interested in batched methods, which change their sampling behaviour only a few times, between batches of observations. We give an instance-dependent lower bound on the number of batches used by any sample efficient algorithm for any pure exploration task. We then give a general batched algorithm and prove upper bounds on its expected sample complexity and batch complexity.

Confirmation: I understand that authors of each paper submitted to EWRL may be asked to review 2-3 other submissions to EWRL.

Serve As Reviewer: ~Adrienne_Tuynman1

Track: Fast Track: published work

Publication Link: adrienne.tuynman@inria.fr

Submission Number: 35

Loading