Published: 01 Jan 2020, Last Modified: 16 May 2023AISTATS 2020Readers: Everyone
Abstract:We consider a decentralized multi-agent Multi Armed Bandit (MAB) setup consisting of $N$ agents, solving the same MAB instance to minimize individual cumulative regret. In our model, agents collabo...