Tight Bounds for Bandit Combinatorial OptimizationDownload PDFOpen Website

2017 (modified: 08 Nov 2022)COLT 2017Readers: Everyone
Abstract: We revisit the study of optimal regret rates in bandit combinatorial optimization—a fundamental framework for sequential decision making under uncertainty that abstracts numerous combinatorial pred...
0 Replies

Loading