2019 (modified: 09 Nov 2022)AISTATS2019Readers: Everyone
Abstract:We prove that two popular linear contextual bandit algorithms, OFUL and Thompson Sampling, can be made efficient using Frequent Directions, a deterministic online sketching technique. More precisel...