SimRPD: Optimizing Recruitment Proactive Dialogue Agents through Simulator-Based Data Evaluation and Selection

Zhiyong Cao; Dunqiang Liu; Qi Dai; Haojun Xu; Huai Yuen Khor; HAO WANG; Huan He; Yafei Liu; Ke Ma; Ruqian Shi; Sicheng Zhou; Sijia Yao

SimRPD: Optimizing Recruitment Proactive Dialogue Agents through Simulator-Based Data Evaluation and Selection

Zhiyong Cao, Dunqiang Liu, Qi Dai, Haojun Xu, Huai Yuen Khor, HAO WANG, Huan He, Yafei Liu, Ke Ma, Ruqian Shi, Sicheng Zhou, Sijia Yao

Published: 18 Apr 2026, Last Modified: 26 Apr 2026ACL 2026 Industry Track OralEveryoneRevisionsBibTeXCC BY 4.0

Keywords: Proactive Dialogue Systems, User Simulation, Data Selection, Chain-of-Intention

TL;DR: SimRPD optimizes recruitment dialogue agents by generating and rigorously filtering synthetic data via a novel Chain-of-Intention evaluation framework.

Abstract: Task-oriented proactive dialogue agents play a pivotal role in recruitment, particularly for steering conversations towards specific business outcomes, such as acquiring social-media contacts for private-channel conversion. Although supervised fine-tuning and reinforcement learning have proven effective for training such agents, their performance is heavily constrained by the scarcity of high-quality, goal-oriented domain-specific training data. To address this challenge, we propose $\textbf{SimRPD}$, a three-stage framework for training recruitment proactive dialogue agents. First, we develop a high-fidelity user simulator to synthesize large-scale conversational data through multi-turn online dialogue. Then we introduce a multi-dimensional evaluation framework based on $\textbf{Chain-of-Intention (CoI)}$ to comprehensively assess the simulator and effectively select high-quality data, incorporating both global-level and instance-level metrics. Finally, we train the recruitment proactive dialogue agent on the selected dataset. Experiments in a real-world recruitment scenario demonstrate that SimRPD outperforms existing simulator-based data selection strategies, highlighting its practical value for industrial deployment and its potential applicability to other business-oriented dialogue scenarios.

Submission Type: Deployed

Copyright Form: pdf

Submission Number: 323

Loading