Toggle navigation
OpenReview
.net
Login
×
Back to
ICML
ICML 2024 Workshop ARLET Submissions
Bisimulation Metrics are Optimal Transport Distances, and Can be Computed Efficiently
Sergio Calo
,
Anders Jonsson
,
Gergely Neu
,
Ludovic Schwartz
,
Javier Segovia-Aguas
Published: 19 Jun 2024, Last Modified: 06 Jan 2025
ARLET 2024 Poster
Readers:
Everyone
Should You Trust DQN?
Aditya Gopalan
,
Gugan Thoppe
Published: 19 Jun 2024, Last Modified: 26 Jul 2024
ARLET 2024 Poster
Readers:
Everyone
Reweighted Bellman Targets for Continual Reinforcement Learning
Ke Sun
,
Jun Jin
,
Xi Chen
,
Wulong Liu
,
Linglong Kong
Published: 19 Jun 2024, Last Modified: 26 Jul 2024
ARLET 2024 Poster
Readers:
Everyone
Accelerated Online Reinforcement Learning using Auxiliary Start State Distributions
Aman Mehra
,
Alexandre Capone
,
Jeff Schneider
Published: 19 Jun 2024, Last Modified: 26 Jul 2024
ARLET 2024 Poster
Readers:
Everyone
Handling Delay in Reinforcement Learning Caused by Parallel Computations of Neurons
Ivan Anokhin
,
Rishav Rishav
,
Stephen Chung
,
Irina Rish
,
Samira Ebrahimi Kahou
Published: 19 Jun 2024, Last Modified: 02 Aug 2024
ARLET 2024 Poster
Readers:
Everyone
Misspecified $Q$-Learning with Sparse Linear Function Approximation: Tight Bounds on Approximation Error
Ally Yalei Du
,
Lin Yang
,
Ruosong Wang
Published: 19 Jun 2024, Last Modified: 26 Jul 2024
ARLET 2024 Poster
Readers:
Everyone
ORSO: Accelerating Reward Design via Online Reward Selection and Policy Optimization
Chen Bo Calvin Zhang
,
Zhang-Wei Hong
,
Aldo Pacchiano
,
Pulkit Agrawal
Published: 19 Jun 2024, Last Modified: 26 Jul 2024
ARLET 2024 Poster
Readers:
Everyone
Decoupled Stochastic Gradient Descent for N-Player Games
Ali Zindari
,
Parham Yazdkhasti
,
Tatjana Chavdarova
,
Sebastian U Stich
Published: 19 Jun 2024, Last Modified: 26 Jul 2024
ARLET 2024 Poster
Readers:
Everyone
Risk-Aware Bandits for Best Crop Management
Dorian Baudry
,
Romain Gautron
Published: 19 Jun 2024, Last Modified: 26 Jul 2024
ARLET 2024 Poster
Readers:
Everyone
A Tractable Inference Perspective of Offline RL
Xuejie Liu
,
Anji Liu
,
Guy Van den Broeck
,
Yitao Liang
Published: 19 Jun 2024, Last Modified: 26 Jul 2024
ARLET 2024 Poster
Readers:
Everyone
Quantized Representations Prevent Dimensional Collapse in Self-predictive RL
Aidan Scannell
,
Kalle Kujanpää
,
Yi Zhao
,
Mohammadreza Nakhaeinezhadfard
,
Arno Solin
,
Joni Pajarinen
Published: 19 Jun 2024, Last Modified: 26 Jul 2024
ARLET 2024 Poster
Readers:
Everyone
EMPO: A Clustering-Based On-Policy Algorithm for Offline Reinforcement Learing
Jongeui Park
,
Myungsik Cho
,
Youngchul Sung
Published: 19 Jun 2024, Last Modified: 26 Jul 2024
ARLET 2024 Poster
Readers:
Everyone
Efficient Offline Reinforcement Learning: The Critic is Critical
Adam Jelley
,
Trevor McInroe
,
Sam Devlin
,
Amos Storkey
Published: 19 Jun 2024, Last Modified: 26 Jul 2024
ARLET 2024 Poster
Readers:
Everyone
Coordination Failure in Cooperative Offline MARL
Callum Rhys Tilbury
,
Juan Claude Formanek
,
Louise Beyers
,
Jonathan Phillip Shock
,
Arnu Pretorius
Published: 19 Jun 2024, Last Modified: 26 Jul 2024
ARLET 2024 Poster
Readers:
Everyone
Generalized Linear Bandits with Limited Adaptivity
Ayush Sawarni
,
Nirjhar Das
,
Siddharth Barman
,
Gaurav Sinha
Published: 19 Jun 2024, Last Modified: 26 Jul 2024
ARLET 2024 Poster
Readers:
Everyone
The Importance of Online Data: Understanding Preference Fine-Tuning via Coverage
Yuda Song
,
Gokul Swamy
,
Aarti Singh
,
Drew Bagnell
,
Wen Sun
Published: 19 Jun 2024, Last Modified: 26 Jul 2024
ARLET 2024 Poster
Readers:
Everyone
Reward Centering
Abhishek Naik
,
Yi Wan
,
Manan Tomar
,
Richard S. Sutton
Published: 19 Jun 2024, Last Modified: 26 Jul 2024
ARLET 2024 Poster
Readers:
Everyone
Offline Reinforcement Learning with Pessimistic Value Priors
Filippo Valdettaro
,
Aldo A. Faisal
Published: 19 Jun 2024, Last Modified: 26 Jul 2024
ARLET 2024 Poster
Readers:
Everyone
Batched fixed-confidence pure exploration for bandits with switching constraints
Newton Mwai
,
Milad Malekipirbazari
,
Fredrik D. Johansson
Published: 19 Jun 2024, Last Modified: 26 Jul 2024
ARLET 2024 Poster
Readers:
Everyone
KalMamba: Towards Efficient Probabilistic State Space Models for RL under Uncertainty
Philipp Becker
,
Niklas Freymuth
,
Gerhard Neumann
Published: 19 Jun 2024, Last Modified: 26 Jul 2024
ARLET 2024 Poster
Readers:
Everyone
Provably Mitigating Overoptimization in RLHF: Your SFT Loss is Implicitly an Adversarial Regularizer
Zhihan Liu
,
Miao Lu
,
Shenao Zhang
,
Boyi Liu
,
Hongyi Guo
,
Yingxiang Yang
,
Jose Blanchet
,
Zhaoran Wang
Published: 19 Jun 2024, Last Modified: 26 Jul 2024
ARLET 2024 Poster
Readers:
Everyone
Provable Partially Observable Reinforcement Learning with Privileged Information
Yang Cai
,
Xiangyu Liu
,
Argyris Oikonomou
,
Kaiqing Zhang
Published: 19 Jun 2024, Last Modified: 26 Jul 2024
ARLET 2024 Poster
Readers:
Everyone
Distributionally Robust Reinforcement Learning with Interactive Data Collection: Fundamental Hardness and Near-Optimal Algorithm
Miao Lu
,
Han Zhong
,
Tong Zhang
,
Jose Blanchet
Published: 19 Jun 2024, Last Modified: 26 Jul 2024
ARLET 2024 Poster
Readers:
Everyone
Wind farm control with cooperative multi-agent reinforcement learning
Claire Bizon Monroc
,
Ana Busic
,
Jiamin Zhu
,
Donatien Dubuc
Published: 19 Jun 2024, Last Modified: 26 Jul 2024
ARLET 2024 Poster
Readers:
Everyone
Dual Approximation Policy Optimization
Zhihan Xiong
,
Maryam Fazel
,
Lin Xiao
Published: 19 Jun 2024, Last Modified: 26 Jul 2024
ARLET 2024 Poster
Readers:
Everyone
«
‹
1
2
3
4
›
»