Evaluating factual accuracy in complex data-to-text

Published: 01 Jan 2023, Last Modified: 18 Jun 2024Comput. Speech Lang. 2023EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Highlights•Factual accuracy problems limit the usefulness of neural solutions for complex data-to-text.•Existing evaluation methods miss many of these errors, such as hallucination.•We propose and evaluate a gold standard protocol for detecting factual errors in generated text.•We show how this gold standard can be used to measure the efficacy of other methods.•We also explore the common types of error in both human-authored and neural data-to-text systems.
Loading

OpenReview is a long-term project to advance science through improved peer review with legal nonprofit status. We gratefully acknowledge the support of the OpenReview Sponsors. © 2025 OpenReview