Towards Zero-Shot, Controllable Dialog Planning with LLMs

Dirk Väth; Thang Vu

Towards Zero-Shot, Controllable Dialog Planning with LLMs

Dirk Väth, Thang Vu

Published: 13 Dec 2024, Last Modified: 02 Mar 2025LM4PlanEveryoneRevisionsBibTeXCC0 1.0

Keywords: NLP: Conversational AI/Dialog Systems, PRS: Planning with Language Models

TL;DR: This paper explores applying LLMs to dialog planning in a way that allows steering the conversation towards an overarching goal, while at the same time avoiding the hallucination problem and retaining an expert-controllable dialog flow.

Abstract: Recently, Large Language Models (LLMs) have emerged as an alternative to training task-specific dialog agents, due to their broad reasoning capabilities and performance in zero-shot learning scenarios. However, many LLM-based dialog systems fall short in planning towards an overarching dialog goal and therefore cannot steer the conversation appropriately. Furthermore, these models struggle with hallucination, making them unsuitable for information access in sensitive domains, such as legal or medical domains, where correctness of information given to users is critical. The recently introduced task Conversational Tree Search (CTS) proposes the use of dialog graphs to avoid hallucination in sensitive domains, however, state-of-the-art agents are Reinforcement Learning (RL) based and require long training times, despite excelling at dialog strategy. This paper introduces a novel zero-shot method for controllable CTS agents, where LLMs guide the dialog planning through domain graphs by searching and pruning relevant graph nodes based on user interaction preferences. We show that these agents significantly outperform state-of-the-art CTS agents ($p<0.0001$; Barnard Exact test) in simulation. This generalizes to all available CTS domains. Finally, we perform user evaluation to test the agent's performance in the wild, showing that our policy significantly ($p<0.05$; Barnard Exact) improves task-success compared to the state-of-the-art RL-based CTS agent.

Submission Number: 19

Loading