2018 (modified: 11 Nov 2022)ICML 2018Readers: Everyone
Abstract:In this paper, we propose a Minimax Concave Penalized Multi-Armed Bandit (MCP-Bandit) algorithm for a decision-maker facing high-dimensional data with latent sparse structure in an online learning ...