Variance-Reduced Conservative Policy IterationDownload PDFOpen Website

2023 (modified: 24 Apr 2023)ALT 2023Readers: Everyone
Abstract: We study the sample complexity of reducing reinforcement learning to a sequence of empirical risk minimization problems over the policy space. Such reductions-based algorithms exhibit local converg...
0 Replies

Loading