MemReasoner: A Memory-augmented LLM Architecture for Multi-hop Reasoning

Ching-Yun Ko; Sihui Dai; Payel Das; Georgios Kollias; Subhajit Chaudhury; Aurelie Lozano

MemReasoner: A Memory-augmented LLM Architecture for Multi-hop Reasoning

Ching-Yun Ko, Sihui Dai, Payel Das, Georgios Kollias, Subhajit Chaudhury, Aurelie Lozano

Published: 10 Oct 2024, Last Modified: 25 Oct 2024Sys2-Reasoning PosterEveryoneRevisionsBibTeXCC BY 4.0

Keywords: Memory; Multi-hop reasoning; Large language model; Generalization

Abstract: Recent benchmarks suggest that there remains significant room to improve large language models’ ability to robustly reason across facts distributed in extremely long documents. In this work, we propose MemReasoner, a new memory-augmented LLM architecture that is trained to perform temporal reasoning, along with multiple computational steps, over the context stored in the memory. Experiments show that MemReasoner trained on the core reasoning facts generalizes better, when compared to off-the-shelf large language models and existing recurrent models, on a test distribution where the required facts are scattered across long natural text up to 128k tokens. Further, MemReasoner demonstrates robust reasoning performance relative to the baselines, when the answer distribution in test samples differs from that in the training set.

Submission Number: 66

Loading