SMART: Semantic Header Flattening and Pseudo-Code-Style Reasoning for LLM-based Complex Table Question Answering

SMART: Semantic Header Flattening and Pseudo-Code-Style Reasoning for LLM-based Complex Table Question Answering

ACL ARR 2026 January Submission7280 Authors

06 Jan 2026 (modified: 20 Mar 2026)ACL ARR 2026 January SubmissionEveryoneRevisionsBibTeXCC BY 4.0

Keywords: Table Question Answering, Table Reasoning, Table Representation

Abstract: Complex table question answering (TQA) remains challenging, as real-world table, usually designed for human readability with multi-level headers and fragmented hierarchical semantics, largely hindering large language models (LLMs) from accurately aligning conditions, attributes, and values during reasoning. Existing approaches typically rely on handcrafted table linearization or prompts, forcing LLMs to infer header hierarchies, which frequently leads to brittle reasoning and hallucinations. To this end, we propose SMART, a unified framework that explicitly decouples table structure understanding from reasoning execution. SMART consists of three components: Semantic Header Flattening for converting multi-level headers into explicit single-level descriptors, Global Understanding for capturing holistic table–question semantics, and Pseudo-Code-Style Reasoning for structured, step-by-step inference with external validation. Extensive experiments on multiple benchmarks demonstrate that SMART substantially improves both the accuracy and robustness of complex TQA, achieving state-of-the-art performance.

Paper Type: Long

Research Area: Question Answering

Research Area Keywords: table QA

Contribution Types: Model analysis & interpretability

Languages Studied: English

Submission Number: 7280

Loading