The Knowledge Gradient Algorithm for a General Class of Online Learning Problems

Ilya O. Ryzhov, Warren B. Powell, Peter I. Frazier

2012 (modified: 28 Sept 2022)Oper. Res. 2012Readers: Everyone

Abstract: We derive a one-period look-ahead policy for finite- and infinite-horizon online optimal learning problems with Gaussian rewards. Our approach is able to handle the case where our prior beliefs abo...

0 Replies