Toggle navigation
OpenReview
.net
Login
×
Back to
ICML
ICML 2023 Workshop MFPL Submissions
Who to imitate: Imitating desired behavior from diverse multi-agent datasets
Tim Franzmeyer
,
Jakob Nicolaus Foerster
,
Edith Elkind
,
Philip Torr
,
Joao F. Henriques
Published: 29 Jun 2023, Last Modified: 04 Oct 2023
MFPL Poster
Readers:
Everyone
Competing Bandits in Non-Stationary Matching Markets
Avishek Ghosh
,
Abishek Sankararaman
,
Kannan Ramchandran
,
Tara Javidi
,
Arya Mazumdar
Published: 29 Jun 2023, Last Modified: 04 Oct 2023
MFPL Poster
Readers:
Everyone
Strategic Apple Tasting
Keegan Harris
,
Chara Podimata
,
Steven Wu
Published: 29 Jun 2023, Last Modified: 04 Oct 2023
MFPL Poster
Readers:
Everyone
Strategyproof Decision-Making in Panel Data Settings and Beyond
Keegan Harris
,
Anish Agarwal
,
Chara Podimata
,
Steven Wu
Published: 29 Jun 2023, Last Modified: 04 Oct 2023
MFPL Poster
Readers:
Everyone
Provable Offline Reinforcement Learning with Human Feedback
Wenhao Zhan
,
Masatoshi Uehara
,
Nathan Kallus
,
Jason D. Lee
,
Wen Sun
Published: 29 Jun 2023, Last Modified: 04 Oct 2023
MFPL Poster
Readers:
Everyone
How to Query Human Feedback Efficiently in RL?
Wenhao Zhan
,
Masatoshi Uehara
,
Wen Sun
,
Jason D. Lee
Published: 29 Jun 2023, Last Modified: 04 Oct 2023
MFPL Oral
Readers:
Everyone
Contextual Bandits and Imitation Learning with Preference-Based Active Queries
Ayush Sekhari
,
Karthik Sridharan
,
Wen Sun
,
Runzhe Wu
Published: 29 Jun 2023, Last Modified: 04 Oct 2023
MFPL Poster
Readers:
Everyone
Principled Reinforcement Learning with Human Feedback from Pairwise or $K$-wise Comparisons
Banghua Zhu
,
Michael Jordan
,
Jiantao Jiao
Published: 29 Jun 2023, Last Modified: 04 Oct 2023
MFPL Oral
Readers:
Everyone
Inverse Game Theory for Stackelberg Games: the Blessing of Bounded Rationality
Jibang Wu
,
Weiran Shen
,
Fei Fang
,
Haifeng Xu
Published: 29 Jun 2023, Last Modified: 04 Oct 2023
MFPL Poster
Readers:
Everyone
Rewarded soups: towards Pareto-optimal alignment by interpolating weights fine-tuned on diverse rewards
Alexandre Rame
,
Guillaume Couairon
,
Corentin Dancette
,
Jean-Baptiste Gaya
,
Mustafa Shukor
,
Laure Soulier
,
Matthieu Cord
Published: 29 Jun 2023, Last Modified: 04 Oct 2023
MFPL Poster
Readers:
Everyone
Reinforcement Learning with Human Feedback: Learning Dynamic Choices via Pessimism
Zihao Li
Published: 29 Jun 2023, Last Modified: 04 Oct 2023
MFPL Poster
Readers:
Everyone
«
‹
1
2
3
›
»