Overview of the CLEF 2025 SimpleText Track

Liana Ermakova, Hosein Azarbonyad, Jan Bakker, Benjamin Vendeville, Jaap Kamps

Published: 01 Jan 2026, Last Modified: 05 Jan 2026CrossrefEveryoneRevisionsCC BY-SA 4.0
Abstract: Building on the success and insights from previous years, the CLEF 2025 SimpleText Track continues to advance the mission of making scientific information more accessible to a broader audience. In 2025, we introduced a new biomedical corpus, based on aligned Cochrane abstracts and plain language summaries, for the main scientific text simplification task. In addition, we devote particular attention to remaining issues of current generative models, focusing on indentifying and classifying overgeneration and other information distortion in the predictions, as well as promoting grounded text generation approaches. This paper presents an overview of the CLEF 2025 SimpleText Track. Task 1 focuses on Text Simplification, aiming to simplify complex scientific texts. Task 2 addresses Controlled Creativity, emphasizing the detection, classification, and avoidance of hallucinations in generated content. Task 3 revisits selected tasks from SimpleText 2024 by popular demand. We discuss the data and benchmarks provided for these tasks, along with preliminary insights and anticipated challenges.
Loading