Toggle navigation
OpenReview
.net
Login
×
Go to
AISTATS 2022
homepage
Learning from an Exploring Demonstrator: Optimal Reward Estimation for Bandits
Wenshuo Guo
,
Kumar Krishna Agrawal
,
Aditya Grover
,
Vidya K. Muthukumar
,
Ashwin Pananjady
2022 (modified: 14 Apr 2023)
AISTATS 2022
Readers:
Everyone
0 Replies
Loading