Toggle navigation
OpenReview
.net
Login
×
Go to
DBLP
homepage
Dynamic Prompt Learning via Policy Gradient for Semi-structured Mathematical Reasoning
Pan Lu
,
Liang Qiu
,
Kai-Wei Chang
,
Ying Nian Wu
,
Song-Chun Zhu
,
Tanmay Rajpurohit
,
Peter Clark
,
Ashwin Kalyan
Published: 01 Jan 2023, Last Modified: 13 May 2025
ICLR 2023
Everyone
Revisions
BibTeX
CC BY-SA 4.0
Loading