RECoRD: A Multi-Agent LLM Framework for Reverse Engineering Codebase to Relational Diagram

Yuan Xue; Xiaoyu Lu; Yunfei Bai; Hoiyi Ng; Yunan Liu

RECoRD: A Multi-Agent LLM Framework for Reverse Engineering Codebase to Relational Diagram

Yuan Xue, Xiaoyu Lu, Yunfei Bai, Hoiyi Ng, Yunan Liu

Published: 09 Jun 2025, Last Modified: 03 Jul 2025FMSD @ ICML 2025EveryoneRevisionsBibTeXCC BY 4.0

Keywords: causal‑graph discovery, large language models, program analysis, reinforcement fine‑tuning, knowledge extraction

TL;DR: We propose RECoRD to fuse static program analysis with RFT‑tuned LLM agents to turn source code into accurate, interpretable causal graphs without human labels.

Abstract: Understanding the behavior and logical structure of complex algorithms is a fundamental challenge in industrial systems. Recent advancements in large language models (LLMs) have demonstrated remarkable code understanding capabilities. However, their potential for reverse engineering algorithms into interpretable causal structures remains unexplored. In this work, we develop a multi-agent framework, RECoRD, that leverages LLMs to Reverse Engineering Codebase to Relational Diagram. RECoRD uses reinforcement fine-tuning (RFT) to enhance the reasoning accuracy of the relation extraction agent. Fine-tuning on expert-curated causal graphs allows smaller specialized models to outperform larger foundation models on domain-specific tasks. The RFT-trained models significantly outperformed their foundation counterparts, improving F1 score from 0.69 to 0.97. RECoRD also exhibited strong generalization, with models fine-tuned on one use case improving performance on others. By automating the construction of interpretable causal models from code, RECoRD has wide-ranging applications in areas such as software debugging, operational optimization, and risk management.

Submission Number: 58

Loading