Toggle navigation
OpenReview
.net
Login
×
Go to
DBLP
homepage
Discovering a set of policies for the worst case reward
Tom Zahavy
,
André Barreto
,
Daniel J. Mankowitz
,
Shaobo Hou
,
Brendan O'Donoghue
,
Iurii Kemaev
,
Satinder Singh
Published: 01 Jan 2021, Last Modified: 25 Jan 2025
ICLR 2021
Everyone
Revisions
BibTeX
CC BY-SA 4.0
Loading