OThink-SRR1: Search, Refine and Reasoning with Reinforced Learning for Large Language Models

Haijian Liang, Zenghao Niu, Junjie Wu, Changwang Zhang, Wangchunshu Zhou, Jun Wang

Published: 2026, Last Modified: 26 May 2026CoRR 2026EveryoneRevisionsBibTeXCC BY-SA 4.0
Loading