Toggle navigation
OpenReview
.net
Login
×
Back to
RLC
RLC 2024 Conference Submissions
Zero-shot cross-modal transfer of Reinforcement Learning policies through a Global Workspace
Léopold Maytié
,
Benjamin Devillers
,
Alexandre Arnold
,
Rufin VanRullen
Published: 15 May 2024, Last Modified: 14 Nov 2024
RLC 2024
Readers:
Everyone
Offline Diversity Maximization under Imitation Constraints
Marin Vlastelica
,
Jin Cheng
,
Georg Martius
,
Pavel Kolev
Published: 15 May 2024, Last Modified: 14 Nov 2024
RLC 2024
Readers:
Everyone
Policy Gradient Algorithms with Monte Carlo Tree Learning for Non-Markov Decision Processes
Tetsuro Morimura
,
Kazuhiro Ota
,
Kenshi Abe
,
Peinan Zhang
Published: 15 May 2024, Last Modified: 14 Nov 2024
RLC 2024
Readers:
Everyone
Dreaming of Many Worlds: Learning Contextual World Models aids Zero-Shot Generalization
Sai Prasanna
,
Karim Farid
,
Raghu Rajan
,
André Biedenkapp
Published: 15 May 2024, Last Modified: 14 Nov 2024
RLC 2024
Readers:
Everyone
An Idiosyncrasy of Time-discretization in Reinforcement Learning
Kris De Asis
,
Richard S. Sutton
Published: 15 May 2024, Last Modified: 14 Nov 2024
RLC 2024
Readers:
Everyone
Enabling Intelligent Interactions between an Agent and an LLM: A Reinforcement Learning Approach
Bin Hu
,
Chenyang Zhao
,
Pu Zhang
,
Zihao Zhou
,
Yuanhang Yang
,
Zenglin Xu
,
Bin Liu
Published: 15 May 2024, Last Modified: 14 Nov 2024
RLC 2024
Readers:
Everyone
Tiered Reward: Designing Rewards for Specification and Fast Learning of Desired Behavior
Zhiyuan Zhou
,
Shreyas Sundara Raman
,
Henry Sowerby
,
Michael Littman
Published: 15 May 2024, Last Modified: 14 Nov 2024
RLC 2024
Readers:
Everyone
A Natural Extension To Online Algorithms For Hybrid RL With Limited Coverage
Kevin Tan
,
Ziping Xu
Published: 15 May 2024, Last Modified: 14 Nov 2024
RLC 2024
Readers:
Everyone
Reinforcement Learning from Human Feedback without Reward Inference: Model-Free Algorithm and Instance-Dependent Analysis
Qining Zhang
,
Honghao Wei
,
Lei Ying
Published: 15 May 2024, Last Modified: 14 Nov 2024
RLC 2024
Readers:
Everyone
More Efficient Randomized Exploration for Reinforcement Learning via Approximate Sampling
Haque Ishfaq
,
Yixin Tan
,
Yu Yang
,
Qingfeng Lan
,
Jianfeng Lu
,
A. Rupam Mahmood
,
Doina Precup
,
Pan Xu
Published: 15 May 2024, Last Modified: 14 Nov 2024
RLC 2024
Readers:
Everyone
Constant Stepsize Q-learning: Distributional Convergence, Bias and Extrapolation
Yixuan Zhang
,
Qiaomin Xie
Published: 15 May 2024, Last Modified: 14 Nov 2024
RLC 2024
Readers:
Everyone
Inverse Reinforcement Learning with Multiple Planning Horizons
Jiayu Yao
,
Weiwei Pan
,
Finale Doshi-Velez
,
Barbara E Engelhardt
Published: 15 May 2024, Last Modified: 14 Nov 2024
RLC 2024
Readers:
Everyone
On Welfare-Centric Fair Reinforcement Learning
Cyrus Cousins
,
Kavosh Asadi
,
Elita Lobo
,
Michael Littman
Published: 15 May 2024, Last Modified: 14 Nov 2024
RLC 2024
Readers:
Everyone
Aquatic Navigation: A Challenging Benchmark for Deep Reinforcement Learning
Davide Corsi
,
Davide Camponogara
,
Alessandro Farinelli
Published: 15 May 2024, Last Modified: 14 Nov 2024
RLC 2024
Readers:
Everyone
Mixture of Experts in a Mixture of RL settings
Timon Willi
,
Johan Samir Obando Ceron
,
Jakob Nicolaus Foerster
,
Gintare Karolina Dziugaite
,
Pablo Samuel Castro
Published: 15 May 2024, Last Modified: 14 Nov 2024
RLC 2024
Readers:
Everyone
Value Internalization: Learning and Generalizing from Social Reward
Frieda Rong
,
Max Kleiman-Weiner
Published: 15 May 2024, Last Modified: 14 Nov 2024
RLC 2024
Readers:
Everyone
On the consistency of hyper-parameter selection in value-based deep reinforcement learning
Johan Samir Obando Ceron
,
João Guilherme Madeira Araújo
,
Aaron Courville
,
Pablo Samuel Castro
Published: 15 May 2024, Last Modified: 14 Nov 2024
RLC 2024
Readers:
Everyone
Demystifying the Recency Heuristic in Temporal-Difference Learning
Brett Daley
,
Marlos C. Machado
,
Martha White
Published: 15 May 2024, Last Modified: 14 Nov 2024
RLC 2024
Readers:
Everyone
Dissecting Deep RL with High Update Ratios: Combatting Value Divergence
Marcel Hussing
,
Claas A Voelcker
,
Igor Gilitschenski
,
Amir-massoud Farahmand
,
Eric Eaton
Published: 15 May 2024, Last Modified: 14 Nov 2024
RLC 2024
Readers:
Everyone
Robotic Manipulation Datasets for Offline Compositional Reinforcement Learning
Marcel Hussing
,
Jorge Mendez-Mendez
,
Anisha Singrodia
,
Cassandra Kent
,
Eric Eaton
Published: 15 May 2024, Last Modified: 14 Nov 2024
RLC 2024
Readers:
Everyone
Exploring Uncertainty in Distributional Reinforcement Learning
Georgy Antonov
,
Peter Dayan
Published: 15 May 2024, Last Modified: 14 Nov 2024
RLC 2024
Readers:
Everyone
Sequential Decision-Making for Inline Text Autocomplete
Rohan Chitnis
,
Shentao Yang
,
Alborz Geramifard
Published: 15 May 2024, Last Modified: 14 Nov 2024
RLC 2024
Readers:
Everyone
Contextualized Hybrid Ensemble Q-learning: Learning Fast with Control Priors
Emma Cramer
,
Bernd Frauenknecht
,
Ramil Sabirov
,
Sebastian Trimpe
Published: 15 May 2024, Last Modified: 14 Nov 2024
RLC 2024
Readers:
Everyone
Multistep Inverse Is Not All You Need
Alexander Levine
,
Peter Stone
,
Amy Zhang
Published: 15 May 2024, Last Modified: 14 Nov 2024
RLC 2024
Readers:
Everyone
The Cliff of Overcommitment with Policy Gradient Step Sizes
Scott M. Jordan
,
Samuel Neumann
,
James E. Kostas
,
Adam White
,
Philip S. Thomas
Published: 15 May 2024, Last Modified: 14 Nov 2024
RLC 2024
Readers:
Everyone
«
‹
1
2
3
4
5
›
»