ParetoMIL: Early Risk Detection in Dialogue under Weak Supervision

Published: 06 Oct 2025, Last Modified: 04 Nov 2025MTI-LLM @ NeurIPS 2025 PosterEveryoneRevisionsBibTeXCC BY-ND 4.0
Keywords: multi-turn dialog, multi-instance learning, early classification of time series, risk detection
TL;DR: We propose a novel framework for early risk detection in multi-turn dialogue that achieves better earliness–accuracy trade-offs using only weak, dialogue-level supervision.
Abstract: Large Language Models (LLMs) increasingly operate in multi-turn interactions where the cost of failure grows with delay, creating a need for turn-level risk assessment and timely alerts. Existing approaches fall short: process reward modeling presumes step-wise labels; multi-instance learning (MIL) overlooks earliness; and early classification of time series (ECTS) neglects the complex relationship between turn-level events and dialogue-level risk. We propose a novel approach that integrates MIL and ECTS to deliver controllable early alerts from weak dialogue-level supervision. A soft-MIL scorer with prefix-conditioned encodings and monotone pooling produces a non-decreasing prefix risk, while a reinforcement-learning trigger, conditioned on a control parameter, balances earliness and accuracy with a single policy that traces the Pareto frontier without retraining. Empirically, our method improves the earliness–accuracy trade-off on multi-turn dialogues compared to strong baselines.
Submission Number: 225
Loading