Orion-RAG: Path-Aligned Hybrid Retrieval for Graphless Data

ACL ARR 2026 January Submission5800 Authors

05 Jan 2026 (modified: 20 Mar 2026)ACL ARR 2026 January SubmissionEveryoneRevisionsBibTeXCC BY 4.0
Keywords: Retrieval Augmented Language Models, Information Retrieval, Commercial, Tag, QA
Abstract: Retrieval-Augmented Generation (RAG) has proven effective for knowledge synthesis, yet it encounters significant challenges in practical scenarios where data is inherently discrete and fragmented. In most environments, information is distributed across isolated files like reports and logs that lack explicit links. Standard search engines process files independently, ignoring the connections between them. Furthermore, manually building Knowledge Graphs is impractical for such vast data. To bridge this gap, we present Orion-RAG. Our core insight is simple yet effective: we do not need heavy algorithms to organize this data. Instead, we use a low-complexity strategy to extract lightweight ``paths'' that naturally link related concepts. We demonstrate that this streamlined approach suffices to transform fragmented documents into semi-structured data, enabling the system to link information across different files effectively. Extensive experiments demonstrate that Orion-RAG consistently outperforms mainstream frameworks across diverse domains, supporting real-time updates and explicit Human-in-the-Loop verification with high cost-efficiency. Experiments on FinanceBench demonstrate superior precision with a 25.2\% relative improvement over strong baselines.
Paper Type: Long
Research Area: Information Extraction and Retrieval
Research Area Keywords: Retrieval-Augmented Generation (RAG), Financial NLP, Efficiency, Document Analysis
Contribution Types: NLP engineering experiment, Data resources
Languages Studied: English
Submission Number: 5800
Loading