ATPO: Adaptive Tree Policy Optimization for Multi-Turn Medical Dialogue

Ruike Cao, Shaojie Bai, Fugen Yao, Liang Dong, Jian Xu, Li Xiao

Published: 2026, Last Modified: 25 May 2026CoRR 2026EveryoneRevisionsBibTeXCC BY-SA 4.0
Loading