Guarantees for Epsilon-Greedy Reinforcement Learning with Function ApproximationDownload PDFOpen Website

2022 (modified: 20 Dec 2022)ICML 2022Readers: Everyone
Abstract: Myopic exploration policies such as epsilon-greedy, softmax, or Gaussian noise fail to explore efficiently in some reinforcement learning tasks and yet, they perform well in many others. In fact, i...
0 Replies

Loading