Toggle navigation
OpenReview
.net
Login
×
Back to
ICML
ICML 2024 Workshop RLControlTheory Submissions
Bridging Distributionally Robust Learning and Offline RL: An Approach to Mitigate Distribution Shift and Partial Data Coverage
Kishan Panaganti
,
Zaiyan Xu
,
Dileep Kalathil
,
Mohammad Ghavamzadeh
Published: 17 Jun 2024, Last Modified: 05 Jul 2024
FoRLaC Poster
Readers:
Everyone
Combining Neural Networks and Symbolic Regression for Analytical Lyapunov Function Discovery
Jie Feng
,
Haohan Zou
,
Yuanyuan Shi
Published: 17 Jun 2024, Last Modified: 12 Jul 2024
FoRLaC Poster
Readers:
Everyone
Randomized Confidence Bounds for Stochastic Partial Monitoring
Maxime Heuillet
,
Ola Ahmad
,
Audrey Durand
Published: 17 Jun 2024, Last Modified: 17 Jun 2024
FoRLaC Poster
Readers:
Everyone
Recurrent Natural Policy Gradient for POMDPs
Semih Cayci
,
Atilla Eryilmaz
Published: 17 Jun 2024, Last Modified: 23 Jul 2024
FoRLaC Poster
Readers:
Everyone
Online Optimization of Closed-Loop Control Systems
Hao Ma
,
Melanie Zeilinger
,
Michael Muehlebach
Published: 17 Jun 2024, Last Modified: 01 Jul 2024
FoRLaC Poster
Readers:
Everyone
Preference Elicitation for Offline Reinforcement Learning
Alizée Pace
,
Bernhard Schölkopf
,
Gunnar Ratsch
,
Giorgia Ramponi
Published: 17 Jun 2024, Last Modified: 17 Jun 2024
FoRLaC Poster
Readers:
Everyone
$\alpha$-Fair Contextual Bandits
Siddhant Chaudhary
,
Abhishek Sinha
Published: 17 Jun 2024, Last Modified: 04 Jul 2024
FoRLaC Poster
Readers:
Everyone
Online Performance Optimization of Nonlinear Systems: A Gray-Box Approach
Zhiyu He
,
Michael Muehlebach
,
Saverio Bolognani
,
Florian Dorfler
Published: 17 Jun 2024, Last Modified: 17 Jul 2024
FoRLaC Poster
Readers:
Everyone
Chained Information-Theoretic Bounds and Tight Regret Rate for Linear Bandit Problems
Amaury Gouverneur
,
Borja Rodríguez Gálvez
,
Tobias Oechtering
,
Mikael Skoglund
Published: 17 Jun 2024, Last Modified: 17 Jun 2024
FoRLaC Poster
Readers:
Everyone
A safe exploration approach to constrained Markov decision processes
Tingting Ni
,
Maryam Kamgarpour
Published: 17 Jun 2024, Last Modified: 22 Jul 2024
FoRLaC Poster
Readers:
Everyone
Finite-time convergence to an $\epsilon$-efficient Nash equilibrium in potential games
Anna Maria Maddux
,
Reda Ouhamma
,
Maryam Kamgarpour
Published: 17 Jun 2024, Last Modified: 22 Jul 2024
FoRLaC Poster
Readers:
Everyone
Learning Nash Equilibria in Zero-Sum Markov Games: A Single-Timescale Algorithm Under Weak Reachability
Reda Ouhamma
,
Maryam Kamgarpour
Published: 17 Jun 2024, Last Modified: 23 Jul 2024
FoRLaC Poster
Readers:
Everyone
A Policy Optimization Approach to the Solution of Unregularized Mean Field Games
Sihan Zeng
,
Sujay Bhatt
,
Alec Koppel
,
Sumitra Ganesh
Published: 17 Jun 2024, Last Modified: 27 Jun 2024
FoRLaC Poster
Readers:
Everyone
Optimality of Stationary Policies in Risk-averse Total-reward MDPs with EVaR
Xihong Su
,
Marek Petrik
,
Julien Grand-Clément
Published: 17 Jun 2024, Last Modified: 27 Jun 2024
FoRLaC Poster
Readers:
Everyone
Reinforcement Learning with Lookahead Information
Nadav Merlis
Published: 17 Jun 2024, Last Modified: 05 Jul 2024
FoRLaC Poster
Readers:
Everyone
The Value of Reward Lookahead in Reinforcement Learning
Nadav Merlis
,
Dorian Baudry
,
Vianney Perchet
Published: 17 Jun 2024, Last Modified: 05 Jul 2024
FoRLaC Poster
Readers:
Everyone
A Best-of-both-worlds Algorithm for Bandits with Delayed Feedback with Robustness to Excessive Delays
Saeed Masoudian
,
Julian Zimmert
,
Yevgeny Seldin
Published: 17 Jun 2024, Last Modified: 17 Jun 2024
FoRLaC Poster
Readers:
Everyone
NEORL: Efficient Exploration for Nonepisodic RL
Bhavya Sukhija
,
Lenart Treven
,
Florian Dorfler
,
Stelian Coros
,
Andreas Krause
Published: 17 Jun 2024, Last Modified: 17 Jun 2024
FoRLaC Poster
Readers:
Everyone
Reinforcement Learning of Adaptive Acquisition Policies for Inverse Problems
Gianluigi Silvestri
,
Fabio Valerio Massoli
,
Tribhuvanesh Orekondy
,
Afshin Abdi
,
Arash Behboodi
Published: 17 Jun 2024, Last Modified: 24 Jul 2024
FoRLaC Poster
Readers:
Everyone
Adaptive Experimental Design for Policy Learning: Contextual Best Arm Identification
Masahiro Kato
,
Kyohei Okumura
,
Takuya Ishihara
,
Toru Kitagawa
Published: 17 Jun 2024, Last Modified: 27 Jul 2024
FoRLaC Poster
Readers:
Everyone
Tight Bounds for Online Convex Optimization with Adversarial Constraints
Abhishek Sinha
,
Rahul Vaze
Published: 17 Jun 2024, Last Modified: 04 Jul 2024
FoRLaC Poster
Readers:
Everyone
CPeSFA: Empowering SFs for Policy Learning and Transfer in Continuous Action Spaces
Yining LI
,
Tianpei Yang
,
Wei Guo
,
Jianye HAO
,
YAN ZHENG
Published: 17 Jun 2024, Last Modified: 29 Jun 2024
FoRLaC Poster
Readers:
Everyone
«
‹
1
2
3
›
»