Toggle navigation
OpenReview
.net
Login
×
Go to
ALT 2023
homepage
Variance-Reduced Conservative Policy Iteration
Naman Agarwal
,
Brian Bullins
,
Karan Singh
2023 (modified: 24 Apr 2023)
ALT 2023
Readers:
Everyone
Abstract:
We study the sample complexity of reducing reinforcement learning to a sequence of empirical risk minimization problems over the policy space. Such reductions-based algorithms exhibit local converg...
0 Replies
Loading