StepTool: Enhancing Multi-Step Tool Usage in LLMs via Step-Grained Reinforcement Learning

Yuanqing Yu, Zhefan Wang, Weizhi Ma, Shuai Wang, Chuhan Wu, Zhiqiang Guo, Min Zhang

Published: 10 Nov 2025, Last Modified: 02 Feb 2026CrossrefEveryoneRevisionsCC BY-SA 4.0
Loading