Toggle navigation
OpenReview
.net
Login
×
Back to
RLC
RLC 2025 Conference Submissions
Achieving Limited Adaptivity for Multinomial Logistic Bandits
Sukruta Prakash Midigeshi
,
Tanmay Goyal
,
Gaurav Sinha
Published: 09 May 2025, Last Modified: 28 May 2025
RLC 2025
Readers:
Everyone
AI in a vat: Fundamental limits of efficient world modelling for safe agent sandboxing
Fernando Rosas
,
Alexander Boyd
,
Manuel Baltieri
Published: 09 May 2025, Last Modified: 28 May 2025
RLC 2025
Readers:
Everyone
Thompson Sampling for Constrained Bandits
Rohan Deb
,
Mohammad Ghavamzadeh
,
Arindam Banerjee
Published: 09 May 2025, Last Modified: 28 May 2025
RLC 2025
Readers:
Everyone
Empirical Bound Information-Directed Sampling
Piotr M. Suder
,
Eric Laber
Published: 09 May 2025, Last Modified: 28 May 2025
RLC 2025
Readers:
Everyone
Representation Learning and Skill Discovery with Empowerment
Andrew Levy
,
Alessandro G Allievi
,
George Konidaris
Published: 09 May 2025, Last Modified: 28 May 2025
RLC 2025
Readers:
Everyone
Learning Fair Pareto-Optimal Policies in Multi-Objective Reinforcement Learning
Umer Siddique
,
Peilang Li
,
Yongcan Cao
Published: 09 May 2025, Last Modified: 28 May 2025
RLC 2025
Readers:
Everyone
Adaptive Submodular Policy Optimization
Branislav Kveton
,
Anup Rao
,
Viet Dac Lai
,
Nikos Vlassis
,
David Arbour
Published: 09 May 2025, Last Modified: 28 May 2025
RLC 2025
Readers:
Everyone
Human-Level Competitive Pokémon via Scalable Offline Reinforcement Learning with Transformers
Jake Grigsby
,
Yuqi Xie
,
Justin Sasek
,
Steven Zheng
,
Yuke Zhu
Published: 09 May 2025, Last Modified: 28 May 2025
RLC 2025
Readers:
Everyone
Reinforcement Learning with Adaptive Temporal Discounting
Sahaj Singh Maini
,
Zoran Tiganj
Published: 09 May 2025, Last Modified: 28 May 2025
RLC 2025
Readers:
Everyone
Shaping Laser Pulses with Reinforcement Learning
Francesco Capuano
,
Davorin Peceli
,
Gabriele Tiboni
Published: 09 May 2025, Last Modified: 28 May 2025
RLC 2025
Readers:
Everyone
PEnGUiN: Partially Equivariant Graph NeUral Networks for Sample Efficient MARL
Joshua McClellan
,
Greyson Brothers
,
Furong Huang
,
Pratap Tokekar
Published: 09 May 2025, Last Modified: 28 May 2025
RLC 2025
Readers:
Everyone
An Analysis of Action-Value Temporal-Difference Methods That Learn State Values
Brett Daley
,
Prabhat Nagarajan
,
Martha White
,
Marlos C. Machado
Published: 09 May 2025, Last Modified: 28 May 2025
RLC 2025
Readers:
Everyone
Goals vs. Rewards: A Comparative Study of Objective Specification Mechanisms
Septia Rani
,
Serena Booth
,
Sarath Sreedharan
Published: 09 May 2025, Last Modified: 28 May 2025
RLC 2025
Readers:
Everyone
Focused Skill Discovery: Using Per-Factor Empowerment to Control State Variables
Jonathan Colaço Carr
,
Qinyi Sun
,
Cameron Allen
Published: 09 May 2025, Last Modified: 28 May 2025
RLC 2025
Readers:
Everyone
On Slowly-varying Non-stationary Bandits
Ramakrishnan K
,
Aditya Gopalan
Published: 09 May 2025, Last Modified: 28 May 2025
RLC 2025
Readers:
Everyone
Deep Reinforcement Learning with Gradient Eligibility Traces
Esraa Elelimy
,
Brett Daley
,
Andrew Patterson
,
Marlos C. Machado
,
Adam White
,
Martha White
Published: 09 May 2025, Last Modified: 28 May 2025
RLC 2025
Readers:
Everyone
A Timer-Enforced Hybrid Supervisor for Robust, Chatter-Free Policy Switching
Jan de Priester
,
Zachary I. Bell
,
Ricardo Sanfelice
Published: 09 May 2025, Last Modified: 28 May 2025
RLC 2025
Readers:
Everyone
One Goal, Many Challenges: Robust Preference Optimization Amid Content-Aware and Multi-Source Noise
Amirabbas Afzali
,
Amirhossein Afsharrad
,
Seyed Shahabeddin Mousavi
,
Sanjay Lall
Published: 09 May 2025, Last Modified: 28 May 2025
RLC 2025
Readers:
Everyone
Offline Action-Free Learning of Ex-BMDPs by Comparing Diverse Datasets
Alexander Levine
,
Peter Stone
,
Amy Zhang
Published: 09 May 2025, Last Modified: 28 May 2025
RLC 2025
Readers:
Everyone
V-Max: Making RL Practical for Autonomous Driving
Valentin Charraut
,
Thomas Tournaire
,
Waël Doulazmi
,
Thibault Buhet
Published: 09 May 2025, Last Modified: 28 May 2025
RLC 2025
Readers:
Everyone
AVG-DICE: Stationary Distribution Correction by Regression
Fengdi Che
,
Bryan Chan
,
Chen Ma
,
A. Rupam Mahmood
Published: 09 May 2025, Last Modified: 28 May 2025
RLC 2025
Readers:
Everyone
PAC Apprenticeship Learning with Bayesian Active Inverse Reinforcement Learning
Ondrej Bajgar
,
Dewi Sid William Gould
,
Jonathon Liu
,
Alessandro Abate
,
Konstantinos Gatsis
,
Michael A Osborne
Published: 09 May 2025, Last Modified: 06 Jun 2025
RLC 2025
Readers:
Everyone
Optimistic critics can empower small actors
Olya Mastikhina
,
Dhruv Sreenivas
,
Pablo Samuel Castro
Published: 09 May 2025, Last Modified: 04 Jun 2025
RLC 2025
Readers:
Everyone
Towards Improving Reward Design in RL: A Reward Alignment Metric for RL Practitioners
Calarina Muslimani
,
Kerrick Johnstonbaugh
,
Suyog Chandramouli
,
Serena Booth
,
W. Bradley Knox
,
Matthew E. Taylor
Published: 09 May 2025, Last Modified: 28 May 2025
RLC 2025
Readers:
Everyone
TransAM: Transformer-Based Agent Modeling for Multi-Agent Systems via Local Trajectory Encoding
Conor Wallace
,
Umer Siddique
,
Yongcan Cao
Published: 09 May 2025, Last Modified: 28 May 2025
RLC 2025
Readers:
Everyone
«
‹
1
2
3
4
5
›
»