OpenReview
.net
OpenReview
.net
Login
OpenReview
.net
Login
Back to
the profile of Xiang Fan
Nano: Nested Human-in-the-Loop Reward Learning for Few-shot Language Model Control
Xiang Fan
,
Yiwei Lyu
,
Paul Pu Liang
,
Ruslan Salakhutdinov
,
Louis-Philippe Morency
Published: 2023, Last Modified: 04 Oct 2023
ACL (Findings) 2023
Readers:
Everyone
0 Replies
Loading