2017 (modified: 08 Nov 2022)COLT 2017Readers: Everyone
Abstract:We revisit the study of optimal regret rates in bandit combinatorial optimization—a fundamental framework for sequential decision making under uncertainty that abstracts numerous combinatorial pred...