Toggle navigation
OpenReview
.net
Login
×
Back to
ICML
ICML 2024 Workshop ARLET Submissions
Reinforcement Learning in the Wild with Maximum Likelihood-based Model Transfer
Hannes Eriksson
,
Tommy Tram
,
Debabrota Basu
,
Mina Alibeigi
,
Christos Dimitrakakis
Published: 19 Jun 2024, Last Modified: 26 Jul 2024
ARLET 2024 Poster
Readers:
Everyone
A Theoretical Framework for Partially-Observed Reward States in RLHF
Chinmaya Kausik
,
Mirco Mutti
,
Aldo Pacchiano
,
Ambuj Tewari
Published: 19 Jun 2024, Last Modified: 26 Jul 2024
ARLET 2024 Poster
Readers:
Everyone
Rethinking Model-based, Policy-based, and Value-based Reinforcement Learning via the Lens of Representation Complexity
Guhao Feng
,
Han Zhong
Published: 19 Jun 2024, Last Modified: 26 Jul 2024
ARLET 2024 Poster
Readers:
Everyone
Transductive Active Learning with Application to Safe Bayesian Optimization
Jonas Hübotter
,
Bhavya Sukhija
,
Lenart Treven
,
Yarden As
,
Andreas Krause
Published: 19 Jun 2024, Last Modified: 26 Jul 2024
ARLET 2024 Oral
Readers:
Everyone
Exploiting Exogenous Structure for Sample-Efficient Reinforcement Learning
Jia Wan
,
Sean R. Sinclair
,
Devavrat Shah
,
Martin J Wainwright
Published: 19 Jun 2024, Last Modified: 26 Jul 2024
ARLET 2024 Poster
Readers:
Everyone
Towards the Transferability of Rewards Recovered via Regularized Inverse Reinforcement Learning
Andreas Schlaginhaufen
,
Maryam Kamgarpour
Published: 19 Jun 2024, Last Modified: 26 Jul 2024
ARLET 2024 Poster
Readers:
Everyone
How Does Return Distribution in Distributional Reinforcement Learning Help Optimization?
Ke Sun
,
Bei Jiang
,
Linglong Kong
Published: 19 Jun 2024, Last Modified: 26 Jul 2024
ARLET 2024 Poster
Readers:
Everyone
Offline RL via Feature-Occupancy Gradient Ascent
Gergely Neu
,
Nneka Okolo
Published: 19 Jun 2024, Last Modified: 26 Jul 2024
ARLET 2024 Poster
Readers:
Everyone
A Case for Validation Buffer in Pessimistic Actor-Critic
Michal Nauman
,
Mateusz Ostaszewski
,
Marek Cygan
Published: 19 Jun 2024, Last Modified: 26 Jul 2024
ARLET 2024 Poster
Readers:
Everyone
Survive on Planet Pandora: Robust Cross-Domain RL Under Distinct State-Action Representations
Kuan-Chen Pan
,
MingHong Chen
,
Xi Liu
,
Ping-Chun Hsieh
Published: 19 Jun 2024, Last Modified: 26 Jul 2024
ARLET 2024 Poster
Readers:
Everyone
Improved Algorithms for Adversarial Bandits with Unbounded Losses
Mingyu Chen
,
Xuezhou Zhang
Published: 19 Jun 2024, Last Modified: 26 Jul 2024
ARLET 2024 Poster
Readers:
Everyone
Contextualized Hybrid Ensemble Q-learning: Learning Fast with Control Priors
Emma Cramer
,
Bernd Frauenknecht
,
Ramil Sabirov
,
Sebastian Trimpe
Published: 19 Jun 2024, Last Modified: 26 Jul 2024
ARLET 2024 Poster
Readers:
Everyone
A Unified Confidence Sequence for Generalized Linear Models, with Applications to Bandits
Junghyun Lee
,
Se-Young Yun
,
Kwang-Sung Jun
Published: 19 Jun 2024, Last Modified: 26 Jul 2024
ARLET 2024 Oral
Readers:
Everyone
Realtime Reinforcement Learning: Towards Rapid Asynchronous Deployment of Large Models
Matthew Riemer
,
Gopeshh Subbaraj
,
Glen Berseth
,
Irina Rish
Published: 19 Jun 2024, Last Modified: 03 Oct 2024
ARLET 2024 Poster
Readers:
Everyone
Locally Interdependent Multi-Agent MDP: Theoretical Framework for Decentralized Agents with Dynamic Dependencies
Alex DeWeese
,
Guannan Qu
Published: 19 Jun 2024, Last Modified: 26 Jul 2024
ARLET 2024 Poster
Readers:
Everyone
RLHF from Heterogeneous Feedback via Personalization and Preference Aggregation
Chanwoo Park
,
Mingyang Liu
,
Dingwen Kong
,
Kaiqing Zhang
,
Asuman E. Ozdaglar
Published: 19 Jun 2024, Last Modified: 26 Jul 2024
ARLET 2024 Poster
Readers:
Everyone
Information Theoretic Guarantees For Policy Alignment In Large Language Models
Youssef Mroueh
Published: 19 Jun 2024, Last Modified: 26 Jul 2024
ARLET 2024 Oral
Readers:
Everyone
Delayed Adversarial Attacks on Stochastic Multi-Armed Bandits
Pierriccardo Olivieri
,
Matteo Castiglioni
,
Nicola Gatti
Published: 19 Jun 2024, Last Modified: 26 Jul 2024
ARLET 2024 Poster
Readers:
Everyone
Markov Persuasion Processes: How to Persuade Multiple Agents From Scratch
Francesco Bacchiocchi
,
Francesco Emanuele Stradi
,
Matteo Castiglioni
,
Nicola Gatti
,
Alberto Marchesi
Published: 19 Jun 2024, Last Modified: 26 Jul 2024
ARLET 2024 Poster
Readers:
Everyone
Adaptive Two-Level Quasi-Monte Carlo for Soft Actor-Critic
Du Ouyang
,
Zhenpeng Shi
,
Aodong Guo
,
Huaze Tang
,
Hejin Wang
,
Chao Wang
,
Wenbo Ding
Published: 19 Jun 2024, Last Modified: 26 Jul 2024
ARLET 2024 Poster
Readers:
Everyone
Acquiring Diverse Skills using Curriculum Reinforcement Learning with Mixture of Experts
Onur Celik
,
Aleksandar Taranovic
,
Gerhard Neumann
Published: 19 Jun 2024, Last Modified: 26 Jul 2024
ARLET 2024 Poster
Readers:
Everyone
Partially Observable Multi-Agent Reinforcement Learning using Mean Field Control
Kai Cui
,
Sascha H. Hauck
,
Christian Fabian
,
Heinz Koeppl
Published: 19 Jun 2024, Last Modified: 26 Jul 2024
ARLET 2024 Oral
Readers:
Everyone
BenchMARL: Benchmarking Multi-Agent Reinforcement Learning
Matteo Bettini
,
Amanda Prorok
,
Vincent Moens
Published: 19 Jun 2024, Last Modified: 26 Jul 2024
ARLET 2024 Poster
Readers:
Everyone
On the Theory of Risk-Aware Agents: Bridging Actor-Critic and Economics
Michal Nauman
,
Marek Cygan
Published: 19 Jun 2024, Last Modified: 26 Jul 2024
ARLET 2024 Poster
Readers:
Everyone
Bigger, Regularized, Optimistic: scaling for compute and sample-efficient continuous control
Michal Nauman
,
Mateusz Ostaszewski
,
Krzysztof Jankowski
,
Piotr Miłoś
,
Marek Cygan
Published: 19 Jun 2024, Last Modified: 26 Jul 2024
ARLET 2024 Poster
Readers:
Everyone
«
‹
1
2
3
4
›
»