Toggle navigation
OpenReview
.net
Login
×
Back to
ICML
ICML 2024 Workshop RLControlTheory Submissions
Reinforcement Learning with Quasi-Hyperbolic Discounting
Eshwar S R
,
Nibedita Roy
,
Gugan Thoppe
Published: 17 Jun 2024, Last Modified: 28 Jul 2024
FoRLaC Poster
Readers:
Everyone
Non-Linear $H_\infty$ Robustness Guarantees for Neural Network Policies
Daniel Urieli
Published: 17 Jun 2024, Last Modified: 28 Jul 2024
FoRLaC Poster
Readers:
Everyone
Distributional Monte-Carlo Planning with Thompson Sampling in Stochastic Environments
Tuan Quang Dam
,
Brahim Driss
,
Odalric-Ambrym Maillard
Published: 17 Jun 2024, Last Modified: 26 Jul 2024
FoRLaC Poster
Readers:
Everyone
Learning When to Trust the Expert for Guided Exploration in RL
Felix Schulz
,
Jasper Hoffmann
,
Yuan Zhang
,
Joschka Boedecker
Published: 17 Jun 2024, Last Modified: 26 Jul 2024
FoRLaC Poster
Readers:
Everyone
Bridging Distributional and Risk-Sensitive Reinforcement Learning: Balancing Statistical, Computational, and Risk Considerations
Hao Liang
Published: 17 Jun 2024, Last Modified: 25 Jul 2024
FoRLaC Poster
Readers:
Everyone
A Simple and Adaptive Learning Rate for FTRL in Online Learning with Minimax Regret of $\Theta(T^{2/3})$ and its Application to Best-of-Both-Worlds
Taira Tsuchiya
,
Shinji Ito
Published: 17 Jun 2024, Last Modified: 28 Jul 2024
FoRLaC Poster
Readers:
Everyone
Safe online nonstochastic control from data
Sebastian Kerz
,
Armin Lederer
,
Marion Leibold
,
Dirk Wollherr
Published: 17 Jun 2024, Last Modified: 22 Jul 2024
FoRLaC Poster
Readers:
Everyone
A Variational Formulation of Reinforcement Learning in Infinite-Horizon Markov Decision Processes
Tim G. J. Rudner
Published: 17 Jun 2024, Last Modified: 26 Jul 2024
FoRLaC Poster
Readers:
Everyone
DARE: The Deep Adaptive Regulator for Control of Uncertain Continuous-Time Systems
Harrison Waldon
,
Fayçal Drissi
,
Yannick Limmer
,
Uljad Berdica
,
Jakob Nicolaus Foerster
,
Alvaro Cartea
Published: 17 Jun 2024, Last Modified: 17 Jun 2024
FoRLaC Poster
Readers:
Everyone
Certifying robustness to adaptive data poisoning
Avinandan Bose
,
Madeleine Udell
,
Laurent Lessard
,
Maryam Fazel
,
Krishnamurthy Dj Dvijotham
Published: 17 Jun 2024, Last Modified: 27 Jul 2024
FoRLaC Poster
Readers:
Everyone
Exploring Integrality Grip for Mixed-integer Programming by MCTS Planning
Defeng Liu
Published: 17 Jun 2024, Last Modified: 17 Jun 2024
FoRLaC Poster
Readers:
Everyone
Power Mean Estimation in Stochastic Monte-Carlo Tree Search
Tuan Quang Dam
,
Odalric-Ambrym Maillard
,
Emilie Kaufmann
Published: 17 Jun 2024, Last Modified: 21 Jul 2024
FoRLaC Poster
Readers:
Everyone
Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy
Cameron Allen
,
Aaron T. Kirtland
,
Ruo Yu Tao
,
Sam Lobel
,
Daniel Scott
,
Nicholas Petrocelli
,
Omer Gottesman
,
Ronald Parr
,
Michael Littman
,
George Konidaris
Published: 17 Jun 2024, Last Modified: 21 Jul 2024
FoRLaC Poster
Readers:
Everyone
The Minimax Regret of Sequential Probability Assignment, Contextual Shtarkov Sums, and Contextual Normalized Maximum Likelihood
Ziyi Liu
,
Idan Attias
,
Daniel M. Roy
Published: 17 Jun 2024, Last Modified: 27 Jul 2024
FoRLaC Poster
Readers:
Everyone
Multiple-policy Evaluation via Density Estimation
Yilei Chen
,
Aldo Pacchiano
,
Ioannis Paschalidis
Published: 17 Jun 2024, Last Modified: 27 Jul 2024
FoRLaC Poster
Readers:
Everyone
Causal Bandits: The Pareto Optimal Frontier of Adaptivity, a Reduction to Linear Bandits, and Limitations around Unknown Marginals
Ziyi Liu
,
Idan Attias
,
Daniel M. Roy
Published: 17 Jun 2024, Last Modified: 27 Jul 2024
FoRLaC Poster
Readers:
Everyone
Pink Noise LQR: How does Colored Noise affect the Optimal Policy in RL?
Jakob Hollenstein
,
Marko Zaric
,
Samuele Tosatto
,
Justus Piater
Published: 17 Jun 2024, Last Modified: 01 Jul 2024
FoRLaC Poster
Readers:
Everyone
Neural Dueling Bandits
Arun Verma
,
Zhongxiang Dai
,
Xiaoqiang Lin
,
Patrick Jaillet
,
Bryan Kian Hsiang Low
Published: 17 Jun 2024, Last Modified: 26 Jul 2024
FoRLaC Poster
Readers:
Everyone
Safe Reinforcement Learning with Contrastive Risk Prediction
Hanping Zhang
,
Yuhong Guo
Published: 17 Jun 2024, Last Modified: 28 Jun 2024
FoRLaC Poster
Readers:
Everyone
Optimistic Information Directed Sampling
Gergely Neu
,
Matteo Papini
,
Ludovic Schwartz
Published: 17 Jun 2024, Last Modified: 17 Jun 2024
FoRLaC Poster
Readers:
Everyone
Bandits with Preference Feedback: A Stackelberg Game Perspective
Barna Pásztor
,
Parnian Kassraie
,
Andreas Krause
Published: 17 Jun 2024, Last Modified: 17 Jun 2024
FoRLaC Poster
Readers:
Everyone
Model Based Diffusion for Trajectory Optimization
Chaoyi Pan
,
Zeji Yi
,
Guanya Shi
,
Guannan Qu
Published: 17 Jun 2024, Last Modified: 17 Jun 2024
FoRLaC Poster
Readers:
Everyone
On PI Controllers for Updating Lagrange Multipliers in Constrained Optimization
Motahareh Sohrabi
,
Juan Ramirez
,
Tianyue H. Zhang
,
Simon Lacoste-Julien
,
Jose Gallego-Posada
Published: 17 Jun 2024, Last Modified: 05 Jul 2024
FoRLaC Poster
Readers:
Everyone
Hybrid Recurrent Models Support Emergent Descriptions for Hierarchical Planning and Control
Poppy Collis
,
Ryan Singh
,
Paul Kinghorn
,
Christopher Buckley
Published: 17 Jun 2024, Last Modified: 22 Jul 2024
FoRLaC Poster
Readers:
Everyone
When is Mean-Field Reinforcement Learning Tractable and Relevant?
Batuhan Yardim
,
Artur Goldman
,
Niao He
Published: 17 Jun 2024, Last Modified: 17 Jun 2024
FoRLaC Poster
Readers:
Everyone
«
‹
1
2
3
›
»