Living in the Moment: Can Large Language Models Grasp Co-Temporal Reasoning?Download PDF

Anonymous

16 Feb 2024ACL ARR 2024 February Blind SubmissionReaders: Everyone
Abstract: Temporal reasoning is fundamental for large language models~(LLMs) to comprehend the world. Current temporal reasoning datasets are limited to questions about single or isolated events, falling short in mirroring the realistic temporal characteristics involving concurrent nature and intricate temporal interconnections. In this paper, we introduce \textsc{CoTemp\-QA}, a comprehensive co-temporal Question Answering (QA) benchmark containing four co-temporal scenarios~(Equal, Overlap, During, Mix) with 4,748 samples for evaluating the co-temporal comprehension and reasoning abilities of LLMs. Our extensive experiments reveal a significant gap between the performance of current LLMs and human-level reasoning on \textsc{CoTemp\-QA} tasks. Even when enhanced with Chain of Thought (CoT) methodologies, models consistently struggle with our task. In our preliminary exploration, we discovered that mathematical reasoning plays a significant role in handling co-temporal events and proposed a strategy to boost LLMs' co-temporal reasoning from a mathematical perspective. We hope that our \textsc{CoTemp\-QA} datasets will encourage further advancements in improving the co-temporal reasoning capabilities of LLMs.
Paper Type: long
Research Area: Question Answering
Contribution Types: Data resources
Languages Studied: English
0 Replies

Loading

OpenReview is a long-term project to advance science through improved peer review with legal nonprofit status. We gratefully acknowledge the support of the OpenReview Sponsors. © 2025 OpenReview