Annotating Hallucinations in Data-to-Text NLG: Implementing a Logical Framework in Different Domains
Abstract: Hallucinations are a persistent challenge in natural language generation, including data-to-text. van Deemter (2024) introduced a unifying framework based on logical consequence, aiming to categorize all hallucinations through a single formal relation. We examine whether human annotators and large language models are able to apply the framework, in two data-to-text domains. Results suggest that the framework is applicable, but they also show up significant domain-dependent variation and discrepancies between human and model judgments. We also uncover several challenges that inform future work on hallucination annotation.
Paper Type: Long
Research Area: Generation
Research Area Keywords: data-to-text generation, domain adaptation, human evaluation, automatic evaluation
Languages Studied: English
Submission Number: 3177
Loading