Deep Dense Exploration for LLM Reinforcement Learning via Pivot-Driven Resampling

Yiran Guo, Zhongjian Qiao, Yingqi Xie, Jie Liu, Dan Ye, Ruiqing Zhang, Shuang Qiu, Lijie Xu

Published: 2026, Last Modified: 24 Apr 2026CoRR 2026EveryoneRevisionsBibTeXCC BY-SA 4.0
Loading