Published: 01 Jan 2021, Last Modified: 12 May 2023ICML 2021Readers: Everyone
Abstract:In this paper, we study the bandits with knapsacks (BwK) problem and develop a primal-dual based algorithm that achieves a problem-dependent logarithmic regret bound. The BwK problem extends the mu...