Instance-Sensitive Algorithms for Pure Exploration in Multinomial Logit Bandit

Nikolai Karpov, Qin Zhang

2022 (modified: 23 Jan 2023)AAAI 2022Readers: Everyone

Abstract: Motivated by real-world applications such as fast fashion retailing and online advertising, the Multinomial Logit Bandit (MNL-bandit) is a popular model in online learning and operations research, and has attracted much attention in the past decade. In this paper, we give efficient algorithms for pure exploration in MNL-bandit. Our algorithms achieve instance-sensitive pull complexities. We also complement the upper bounds by an almost matching lower bound.

0 Replies