Toggle navigation
OpenReview
.net
Login
×
Back to
EWRL
EWRL 2024 Workshop Submissions
A Look at Value-Based Decision-Time vs. Background Planning Methods Across Different Settings
Safa Alver
,
Doina Precup
Published: 01 Aug 2024, Last Modified: 09 Oct 2024
EWRL17
Readers:
Everyone
An Attentive Approach for Building Partial Reasoning Agents from Pixels
Safa Alver
,
Doina Precup
Published: 01 Aug 2024, Last Modified: 09 Oct 2024
EWRL17
Readers:
Everyone
Functional Acceleration for Policy Mirror Descent
Veronica Chelu
,
Doina Precup
Published: 01 Aug 2024, Last Modified: 09 Oct 2024
EWRL17
Readers:
Everyone
Scaling Value Iteration Networks to 5000 Layers for Extreme Long-Term Planning
Yuhui Wang
,
Qingyuan Wu
,
Weida Li
,
Dylan R. Ashley
,
Francesco Faccio
,
Chao Huang
,
Jürgen Schmidhuber
Published: 01 Aug 2024, Last Modified: 09 Oct 2024
EWRL17
Readers:
Everyone
Robust Best-of-Both-Worlds Gap Estimators Based on Importance-Weighted Sampling
Sarah Clusiau
,
Saeed Masoudian
,
Yevgeny Seldin
Published: 01 Aug 2024, Last Modified: 09 Oct 2024
EWRL17
Readers:
Everyone
Explore-Go: Leveraging Exploration for Generalisation in Deep Reinforcement Learning
Max Weltevrede
,
Felix Kaubek
,
Matthijs T. J. Spaan
,
Wendelin Boehmer
Published: 01 Aug 2024, Last Modified: 09 Oct 2024
EWRL17
Readers:
Everyone
Beyond Stationarity: Convergence Analysis of Stochastic Softmax Policy Gradient Methods
Sara Klein
,
Simon Weissmann
,
Leif Döring
Published: 01 Aug 2024, Last Modified: 09 Oct 2024
EWRL17
Readers:
Everyone
A Distributional Analogue to the Successor Representation
Harley Wiltzer
,
Jesse Farebrother
,
Arthur Gretton
,
Yunhao Tang
,
Andre Barreto
,
Will Dabney
,
Marc G Bellemare
,
Mark Rowland
Published: 01 Aug 2024, Last Modified: 09 Oct 2024
EWRL17
Readers:
Everyone
Truly No-Regret Learning in Constrained MDPs
Adrian Müller
,
Pragnya Alatur
,
Volkan Cevher
,
Giorgia Ramponi
,
Niao He
Published: 01 Aug 2024, Last Modified: 09 Oct 2024
EWRL17
Readers:
Everyone
Evidence on the regularization properties of Maximum-Entropy Reinforcement Learning
Remy Hosseinkhan Boucher
,
Lionel Mathelin
,
Onofrio Semeraro
Published: 01 Aug 2024, Last Modified: 09 Oct 2024
EWRL17
Readers:
Everyone
Applying Reinforcement Learning to Navigation In Partially Observable Flows
Selim Mecanna
,
Aurore Loisy
,
Christophe Eloy
Published: 01 Aug 2024, Last Modified: 09 Oct 2024
EWRL17
Readers:
Everyone
A Conservative Approach for Few-Shot Transfer in Off-Dynamics Reinforcement Learning
Paul Daoudi
,
CHRISTOPHE PRIEUR
,
Bogdan Robu
,
Merwan Barlier
,
Ludovic Dos Santos
Published: 01 Aug 2024, Last Modified: 09 Oct 2024
EWRL17
Readers:
Everyone
The Whys and Hows of Active Exploration in Model-Based Reinforcement Learning
Alberto Caron
,
Chris Hicks
,
Vasilios Mavroudis
Published: 01 Aug 2024, Last Modified: 09 Oct 2024
EWRL17
Readers:
Everyone
Learning mirror maps in policy mirror descent
Carlo Alfano
,
Sebastian Rene Towers
,
Silvia Sapora
,
Chris Lu
,
Patrick Rebeschini
Published: 01 Aug 2024, Last Modified: 09 Oct 2024
EWRL17
Readers:
Everyone
Generalisation to unseen topologies: Towards control of biological neural network activity
Laurens Engwegen
,
Daan Brinks
,
Wendelin Boehmer
Published: 01 Aug 2024, Last Modified: 09 Oct 2024
EWRL17
Readers:
Everyone
RRLS : Robust Reinforcement Learning Suite
Adil Zouitine
,
David Bertoin
,
Pierre Clavier
,
Matthieu Geist
,
Emmanuel Rachelson
Published: 01 Aug 2024, Last Modified: 09 Oct 2024
EWRL17
Readers:
Everyone
CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity
Aditya Bhatt
,
Daniel Palenicek
,
Boris Belousov
,
Max Argus
,
Artemij Amiranashvili
,
Thomas Brox
,
Jan Peters
Published: 01 Aug 2024, Last Modified: 09 Oct 2024
EWRL17
Readers:
Everyone
Time-Constrained Robust MDPs
Adil Zouitine
,
David Bertoin
,
Pierre Clavier
,
Matthieu Geist
,
Emmanuel Rachelson
Published: 01 Aug 2024, Last Modified: 09 Oct 2024
EWRL17
Readers:
Everyone
Interpretable and Editable Programmatic Tree Policies for Reinforcement Learning
Hector Kohler
,
Quentin Delfosse
,
Riad Akrour
,
Kristian Kersting
,
Philippe Preux
Published: 01 Aug 2024, Last Modified: 09 Oct 2024
EWRL17
Readers:
Everyone
Earth Observation Satellite Scheduling with Graph Neural Networks
Guillaume Infantes
,
Antoine Jacquet
,
Emmanuel Benazera
,
Stéphanie Roussel
,
Nicolas Meuleau
,
Vincent Baudoui
,
Jonathan Guerra
Published: 01 Aug 2024, Last Modified: 09 Oct 2024
EWRL17
Readers:
Everyone
Maximum Entropy On-Policy Actor-Critic via Entropy Advantage Estimation
Jean Seong Bjorn Choe
,
Jong-Kook Kim
Published: 01 Aug 2024, Last Modified: 09 Oct 2024
EWRL17
Readers:
Everyone
Deterministic Exploration via Stationary Bellman Error Maximization
Sebastian Griesbach
,
Carlo D'Eramo
Published: 01 Aug 2024, Last Modified: 09 Oct 2024
EWRL17
Readers:
Everyone
Adversarial Contextual Bandits Go Kernelized
Gergely Neu
,
Julia Olkhovskaya
,
Sattar Vakili
Published: 01 Aug 2024, Last Modified: 09 Oct 2024
EWRL17
Readers:
Everyone
Sum-Max Submodular Bandits
Stephen Pasteris
,
Alberto Rumi
,
Fabio Vitale
,
Nicolò Cesa-Bianchi
Published: 01 Aug 2024, Last Modified: 09 Oct 2024
EWRL17
Readers:
Everyone
Dreaming of Many Worlds: Learning Contextual World Models aids Zero-Shot Generalization
Sai Prasanna
,
Karim Farid
,
Raghu Rajan
,
André Biedenkapp
Published: 01 Aug 2024, Last Modified: 09 Oct 2024
EWRL17
Readers:
Everyone
«
‹
1
2
3
4
5
›
»