RusConText Benchmark: A Russian Language Evaluation Benchmark for Understanding Context

Andrey Chirkin; Svetlana Kuznetsova; Maria Volina; Anna Dengina

RusConText Benchmark: A Russian Language Evaluation Benchmark for Understanding Context

Andrey Chirkin, Svetlana Kuznetsova, Maria Volina, Anna Dengina

Published: 22 Jun 2025, Last Modified: 17 Jul 2025ACL-SRW 2025 PosterEveryoneRevisionsBibTeXCC BY 4.0

Keywords: large language model, context understanding, benchmark

TL;DR: The paper presents new context understanding benchmark for Russian, encompassing ellipsis and coreference resolution, discourse and idiom understanding.

Abstract: This paper represents an implementation of an approach rather similar to that of (Zhu et al., 2024), adapted for the Russian-language data. We introduce the RusConText Benchmark for evaluating short-context understanding in Russian, comprising four distinct yet interrelated tasks: coreference resolution, discourse understanding, idiom interpretation and ellipsis resolution. Each task targets a specific aspect of linguistic processing, challenging a large language model to recover omitted information, resolve referential dependencies, interpret idioms and discourse. The RusConText Benchmark is an additional resource beyond standard benchmarks, designed to assess model performance from a specific perspective. In addition, we present the results of scoring 4 models on our benchmark.

Archival Status: Archival

Acl Copyright Transfer: pdf

Paper Length: Long Paper (up to 8 pages of content)

Submission Number: 343

Loading