Value-aware Importance Weighting for Off-policy Reinforcement LearningDownload PDFOpen Website

Published: 01 Jan 2023, Last Modified: 09 Mar 2024CoLLAs 2023Readers: Everyone
Abstract: Importance sampling is a central idea underlying off-policy prediction in reinforcement learning. It provides a strategy for re-weighting samples from a distribution to represent unbiased estimates...
0 Replies

Loading