The Knowledge Gradient Algorithm for a General Class of Online Learning ProblemsOpen Website

2012 (modified: 28 Sept 2022)Oper. Res. 2012Readers: Everyone
Abstract: We derive a one-period look-ahead policy for finite- and infinite-horizon online optimal learning problems with Gaussian rewards. Our approach is able to handle the case where our prior beliefs abo...
0 Replies

Loading