Toggle navigation
OpenReview
.net
Login
×
×
BibTeX Record
Click anywhere on the box above to highlight complete record
Back to
RLC
RLC 2025 Conference Submissions
Offline Reinforcement Learning with Domain-Unlabeled Data
Soichiro Nishimori
,
Xin-Qiang Cai
,
Johannes Ackermann
,
Masashi Sugiyama
Published: 09 May 2025, Last Modified: 28 May 2025
RLC 2025
Readers:
Everyone
Multi-task Representation Learning for Fixed Budget Pure-Exploration in Linear and Bilinear Bandits
Subhojyoti Mukherjee
,
Qiaomin Xie
,
Robert D Nowak
Published: 09 May 2025, Last Modified: 28 May 2025
RLC 2025
Readers:
Everyone
Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit Learning
Subhojyoti Mukherjee
,
Josiah P. Hanna
,
Qiaomin Xie
,
Robert D Nowak
Published: 09 May 2025, Last Modified: 28 May 2025
RLC 2025
Readers:
Everyone
Finer Behavioral Foundation Models via Auto-Regressive Features and Advantage Weighting
Edoardo Cetin
,
Ahmed Touati
,
Yann Ollivier
Published: 09 May 2025, Last Modified: 28 May 2025
RLC 2025
Readers:
Everyone
ProtoCRL: Prototype-based Network for Continual Reinforcement Learning
Michela Proietti
,
Peter R. Wurman
,
Peter Stone
,
Roberto Capobianco
Published: 09 May 2025, Last Modified: 28 May 2025
RLC 2025
Readers:
Everyone
Leveraging priors on distribution functions for multi-arm bandits
Sumit Vashishtha
,
Odalric-Ambrym Maillard
Published: 09 May 2025, Last Modified: 28 May 2025
RLC 2025
Readers:
Everyone
Mitigating Suboptimality of Deterministic Policy Gradients in Complex Q-functions
Ayush Jain
,
Norio Kosaka
,
Xinhu Li
,
Kyung-Min Kim
,
Erdem Biyik
,
Joseph J Lim
Published: 09 May 2025, Last Modified: 28 May 2025
RLC 2025
Readers:
Everyone
Understanding Learned Representations and Action Collapse in Visual Reinforcement Learning
Xi Chen
,
Zhihui Zhu
,
Andrew Perrault
Published: 09 May 2025, Last Modified: 28 May 2025
RLC 2025
Readers:
Everyone
Efficient Morphology-Aware Policy Transfer to New Embodiments
Michael Przystupa
,
Hongyao Tang
,
Glen Berseth
,
Mariano Phielipp
,
Santiago Miret
,
Martin Jägersand
,
Matthew E. Taylor
Published: 09 May 2025, Last Modified: 28 May 2025
RLC 2025
Readers:
Everyone
MixUCB: Enhancing Safe Exploration in Contextual Bandits with Human Oversight
Jinyan Su
,
Wen Sun
,
Sarah Dean
,
Rohan Banerjee
,
Jiankai Sun
Published: 09 May 2025, Last Modified: 28 May 2025
RLC 2025
Readers:
Everyone
Adaptive Reward Sharing to Enhance Learning in the Context of Multiagent Teams
Kyle Tilbury
,
David Radke
Published: 09 May 2025, Last Modified: 18 Jun 2025
RLC 2025
Readers:
Everyone
High-Confidence Policy Improvement from Human Feedback
Hon Tik Tse
,
Philip S. Thomas
,
Scott Niekum
Published: 09 May 2025, Last Modified: 28 May 2025
RLC 2025
Readers:
Everyone
Rectifying Regression in Reinforcement Learning
Alex Ayoub
,
David Szepesvari
,
Alireza Bakhtiari
,
Csaba Szepesvari
,
Dale Schuurmans
Published: 09 May 2025, Last Modified: 28 May 2025
RLC 2025
Readers:
Everyone
Benchmarking Partial Observability in Reinforcement Learning with a Suite of Memory-Improvable Domains
Ruo Yu Tao
,
Kaicheng Guo
,
Cameron Allen
,
George Konidaris
Published: 09 May 2025, Last Modified: 28 May 2025
RLC 2025
Readers:
Everyone
Uncovering RL Integration in SSL Loss: Objective-Specific Implications for Data-Efficient RL
Ömer Veysel Çağatan
,
Baris Akgun
Published: 09 May 2025, Last Modified: 28 May 2025
RLC 2025
Readers:
Everyone
PufferLib 2.0: Reinforcement Learning at 1M steps/s
Joseph Suarez
Published: 09 May 2025, Last Modified: 09 May 2025
RLC 2025
Readers:
Everyone
Reinforcement Learning for Human-AI Collaboration via Probabilistic Intent Inference
Yuxin Lin
,
Seyede Fatemeh Ghoreishi
,
Tian Lan
,
Mahdi Imani
Published: 09 May 2025, Last Modified: 28 May 2025
RLC 2025
Readers:
Everyone
Make the Pertinent Salient: Task-Relevant Reconstruction for Visual Control with Distractions
Kyungmin Kim
,
JB Lanier
,
Roy Fox
Published: 09 May 2025, Last Modified: 28 May 2025
RLC 2025
Readers:
Everyone
Optimal discounting for offline input-driven MDP
Randy Lefebvre
,
Audrey Durand
Published: 09 May 2025, Last Modified: 16 Jun 2025
RLC 2025
Readers:
Everyone
Benchmarking Massively Parallelized Multi-Task Reinforcement Learning for Robotics Tasks
Viraj Joshi
,
Zifan Xu
,
Bo Liu
,
Peter Stone
,
Amy Zhang
Published: 09 May 2025, Last Modified: 28 May 2025
RLC 2025
Readers:
Everyone
HANQ: Hypergradients, Asymmetry, and Normalization for Fast and Stable Deep $Q$-Learning
Braham Snyder
,
Chen-Yu Wei
Published: 09 May 2025, Last Modified: 28 May 2025
RLC 2025
Readers:
Everyone
Long-Horizon Planning with Predictable Skills
Nico Gürtler
,
Georg Martius
Published: 09 May 2025, Last Modified: 28 May 2025
RLC 2025
Readers:
Everyone
Mitigating Goal Misgeneralization via Minimax Regret
Karim Abdel Sadek
,
Matthew Farrugia-Roberts
,
Usman Anwar
,
Hannah Erlebach
,
Christian Schroeder de Witt
,
David Krueger
,
Michael D Dennis
Published: 09 May 2025, Last Modified: 11 Jun 2025
RLC 2025
Readers:
Everyone
DisDP: Robust Imitation Learning via Disentangled Diffusion Policies
Pankhuri Vanjani
,
Paul Mattes
,
Xiaogang Jia
,
Vedant Dave
,
Rudolf Lioutikov
Published: 09 May 2025, Last Modified: 28 May 2025
RLC 2025
Readers:
Everyone
Nonparametric Policy Improvement in Continuous Action Spaces via Expert Demonstrations
Agustin Castellano
,
Sohrab Rezaei
,
Jared Markowitz
,
Enrique Mallada
Published: 09 May 2025, Last Modified: 28 May 2025
RLC 2025
Readers:
Everyone
«
‹
1
2
3
4
5
›
»