2021 (modified: 04 Nov 2022)Manag. Sci. 2021Readers: Everyone
Abstract:The contextual bandit literature has traditionally focused on algorithms that address the exploration–exploitation tradeoff. In particular, greedy algorithms that exploit current estimates without ...