Toggle navigation
OpenReview
.net
Login
×
Back to
RLC
RLC 2025 Conference Submissions
Learning to Explore in Diverse Reward Settings via Temporal-Difference-Error Maximization
Sebastian Griesbach
,
Carlo D'Eramo
Published: 09 May 2025, Last Modified: 02 Jun 2025
RLC 2025
Readers:
Everyone
Disentangling Recognition and Decision Regrets in Image-Based Reinforcement Learning
Alihan Hüyük
,
Arndt Ryo Koblitz
,
Atefeh Mohajeri Moghaddam
,
Matthew Andrews
Published: 09 May 2025, Last Modified: 28 May 2025
RLC 2025
Readers:
Everyone
Eau De $Q$-Network: Adaptive Distillation of Neural Networks in Deep Reinforcement Learning
Théo Vincent
,
Tim Faust
,
Yogesh Tripathi
,
Jan Peters
,
Carlo D'Eramo
Published: 09 May 2025, Last Modified: 28 May 2025
RLC 2025
Readers:
Everyone
Multi-Task Reinforcement Learning Enables Parameter Scaling
Reginald McLean
,
Evangelos Chatzaroulas
,
J K Terry
,
Isaac Woungang
,
Nariman Farsad
,
Pablo Samuel Castro
Published: 09 May 2025, Last Modified: 28 May 2025
RLC 2025
Readers:
Everyone
Fast Adaptation with Behavioral Foundation Models
Harshit Sikchi
,
Andrea Tirinzoni
,
Ahmed Touati
,
Yingchen Xu
,
Anssi Kanervisto
,
Scott Niekum
,
Amy Zhang
,
Alessandro Lazaric
,
Matteo Pirotta
Published: 09 May 2025, Last Modified: 28 May 2025
RLC 2025
Readers:
Everyone
Impoola: The Power of Average Pooling for Image-based Deep Reinforcement Learning
Raphael Trumpp
,
Ansgar Schäfftlein
,
Mirco Theile
,
Marco Caccamo
Published: 09 May 2025, Last Modified: 28 May 2025
RLC 2025
Readers:
Everyone
A Finite-Sample Analysis of an Actor-Critic Algorithm for Mean-Variance Optimization in a Discounted MDP
Tejaram Sangadi
,
Prashanth L. A.
,
Krishna Jagannathan
Published: 09 May 2025, Last Modified: 28 May 2025
RLC 2025
Readers:
Everyone
Recursive Reward Aggregation
Yuting Tang
,
Yivan Zhang
,
Johannes Ackermann
,
Yu-Jie Zhang
,
Soichiro Nishimori
,
Masashi Sugiyama
Published: 09 May 2025, Last Modified: 28 May 2025
RLC 2025
Readers:
Everyone
Efficient Information Sharing for Training Decentralized Multi-Agent World Models
Xiaoling Zeng
,
Qi Zhang
Published: 09 May 2025, Last Modified: 28 May 2025
RLC 2025
Readers:
Everyone
Quantitative Resilience Modeling for Autonomous Cyber Defense
Xavier Cadet
,
Simona Boboila
,
Edward Koh
,
Peter Chin
,
Alina Oprea
Published: 09 May 2025, Last Modified: 28 May 2025
RLC 2025
Readers:
Everyone
Pure Exploration for Constrained Best Mixed Arm Identification with a Fixed Budget
Dengwang Tang
,
Rahul Jain
,
Ashutosh Nayyar
,
Pierluigi Nuzzo
Published: 09 May 2025, Last Modified: 28 May 2025
RLC 2025
Readers:
Everyone
Towards Large Language Models that Benefit for All: Benchmarking Group Fairness in Reward Models
Kefan Song
,
Jin Yao
,
Runnan Jiang
,
Rohan Chandra
,
Shangtong Zhang
Published: 09 May 2025, Last Modified: 28 May 2025
RLC 2025
Readers:
Everyone
Drive Fast, Learn Faster: On-Board RL for High Performance Autonomous Racing
Benedict Hildisch
,
Edoardo Ghignone
,
Nicolas Baumann
,
Cheng Hu
,
Andrea Carron
,
Michele Magno
Published: 09 May 2025, Last Modified: 28 May 2025
RLC 2025
Readers:
Everyone
The Confusing Instance Principle for Online Linear Quadratic Control
Waris Radji
,
Odalric-Ambrym Maillard
Published: 09 May 2025, Last Modified: 28 May 2025
RLC 2025
Readers:
Everyone
Hierarchical Multi-agent Reinforcement Learning for Cyber Network Defense
Aditya Vikram Singh
,
Ethan Rathbun
,
Emma Graham
,
Lisa Oakley
,
Simona Boboila
,
Peter Chin
,
Alina Oprea
Published: 09 May 2025, Last Modified: 28 May 2025
RLC 2025
Readers:
Everyone
Non-Stationary Latent Auto-Regressive Bandits
Anna L. Trella
,
Walter H. Dempsey
,
Asim Gazi
,
Ziping Xu
,
Finale Doshi-Velez
,
Susan Murphy
Published: 09 May 2025, Last Modified: 28 May 2025
RLC 2025
Readers:
Everyone
AVID: Adapting Video Diffusion Models to World Models
Marc Rigter
,
Tarun Gupta
,
Agrin Hilmkil
,
Chao Ma
Published: 09 May 2025, Last Modified: 09 May 2025
RLC 2025
Readers:
Everyone
Reinforcement Learning from Human Feedback with High-Confidence Safety Guarantees
Yaswanth Chittepu
,
Blossom Metevier
,
Will Schwarzer
,
Scott Niekum
,
Philip S. Thomas
Published: 09 May 2025, Last Modified: 28 May 2025
RLC 2025
Readers:
Everyone
When and Why Hyperbolic Discounting Matters for Reinforcement Learning Interventions
Ian M. Moore
,
Eura Nofshin
,
Siddharth Swaroop
,
Susan Murphy
,
Finale Doshi-Velez
,
Weiwei Pan
Published: 09 May 2025, Last Modified: 28 May 2025
RLC 2025
Readers:
Everyone
WOFOSTGym: A Crop Simulator for Learning Annual and Perennial Crop Management Strategies
William Solow
,
Sandhya Saisubramanian
,
Alan Fern
Published: 09 May 2025, Last Modified: 28 May 2025
RLC 2025
Readers:
Everyone
Pareto Optimal Learning from Preferences with Hidden Context
Ryan Bahlous-Boldi
,
Li Ding
,
Lee Spector
,
Scott Niekum
Published: 09 May 2025, Last Modified: 28 May 2025
RLC 2025
Readers:
Everyone
RL$^3$: Boosting Meta Reinforcement Learning via RL inside RL$^2$
Abhinav Bhatia
,
Samer B. Nashed
,
Shlomo Zilberstein
Published: 09 May 2025, Last Modified: 09 May 2025
RLC 2025
Readers:
Everyone
Uncertainty Prioritized Experience Replay
Rodrigo Antonio Carrasco-Davis
,
Sebastian Lee
,
Claudia Clopath
,
Will Dabney
Published: 09 May 2025, Last Modified: 28 May 2025
RLC 2025
Readers:
Everyone
Offline vs. Online Learning in Model-based RL: Lessons for Data Collection Strategies
Jiaqi Chen
,
Ji Shi
,
Cansu Sancaktar
,
Jonas Frey
,
Georg Martius
Published: 09 May 2025, Last Modified: 28 May 2025
RLC 2025
Readers:
Everyone
Improved Regret Bound for Safe Reinforcement Learning via Tighter Cost Pessimism and Reward Optimism
Kihyun Yu
,
Duksang Lee
,
William Overman
,
Dabeen Lee
Published: 09 May 2025, Last Modified: 28 May 2025
RLC 2025
Readers:
Everyone
«
‹
1
2
3
4
5
›
»