Follow the Flow: Fine-grained Flowchart Attribution with Neurosymbolic Agents

Follow the Flow: Fine-grained Flowchart Attribution with Neurosymbolic Agents

ACL ARR 2025 February Submission6326 Authors

16 Feb 2025 (modified: 09 May 2025)ACL ARR 2025 February SubmissionEveryoneRevisionsBibTeXCC BY 4.0

Abstract:

Flowcharts are a critical tool for visualizing decision-making processes, yet their nonlinear structure and complex visual-textual relationships make them challenging to interpret automatically. Vision language models frequently hallucinate non-existent connections and decision paths when analyzing these diagrams, compromising the reliability of automated flowchart processing. We introduce the task of Fine-grained Flowchart Attribution, a post-hoc strategy to mitigate visual flowchart hallucination by tracing specific components grounding a flowchart referring statement. Flowchart Attribution ensures the verifiability of model predictions and enhances explainability by linking generated responses to the flowchart’s structure. We introduce FlowExplainBench, a novel benchmark for evaluating flowchart attribution, encompassing diverse styles, domains, and question types while incorporating high-quality attribution annotations. We introduce FlowPathAgent, a neurosymbolic agent that performs fine-grained attribution through graph-based reasoning. It first segments the flowchart, then converts it into a structured symbolic graph, and then employs an agentic approach to dynamically interact with the graph to generate attribution paths. Experimental results show significant performance improvements, with FlowPathAgent outperforming existing methods by 6% to 65% on FlowExplainBench.

Paper Type: Long

Research Area: Multimodality and Language Grounding to Vision, Robotics and Beyond

Research Area Keywords: vision question answering; cross-modal information extraction; cross-modal content generation;

Contribution Types: Model analysis & interpretability, NLP engineering experiment, Publicly available software and/or pre-trained models, Data resources

Languages Studied: English

Submission Number: 6326

Loading