Efficient Online Bandit Multiclass Learning with Õ(√T) RegretDownload PDFOpen Website

2017 (modified: 11 Nov 2022)ICML 2017Readers: Everyone
Abstract: We present an efficient second-order algorithm with $\tilde{O}(1/\eta \sqrt{T})$ regret for the bandit online multiclass problem. The regret bound holds simultaneously with respect to a family of l...
0 Replies

Loading