BioMedReasoner: Towards Multi-Hop Reasoning using Path-based Relational Learning on Biomedical Knowledge Graphs

Published: 24 Sept 2025, Last Modified: 26 Dec 2025NeurIPS2025-AI4Science PosterEveryoneRevisionsBibTeXCC BY 4.0
Additional Submission Instructions: For the camera-ready version, please include the author names and affiliations, funding disclosures, and acknowledgements.
Track: Track 1: Original Research/Position/Education/Attention Track
Keywords: drug discovery, biomedical knowledge graph, link prediction, path interpretations, inflammatory bowel disease
TL;DR: We propose BioMedReasoner as a modular framework that completes biomedical KGs and extracts gene-to-IBD paths with biological processes and molecular functions. Supplying anonymized paths to LLMs enables interpretable reasoning and target assessment.
Abstract: Scientific discovery of novel drug assets requires rigorous reasoning over the complex interplay of biomedical entities. By integrating multi-modal entities, biomedical knowledge graphs (KGs) have emerged as powerful resources for tasks such as identifying targets and uncovering mechanisms of drug action. However, effective reasoning over those KGs is often limited by their incompleteness and noisy information, hindering reliable downstream tasks. To address this challenge, we propose BioMedReasoner, a modular reasoning framework that performs KG completion as a precursor to interpretable multi-hop reasoning. Our approach builds on Neural Bellman-Ford Networks (NBFNet) that formulates link prediction as a path-based relational learning problem, enabling interpretable multi-hop predictions for biomedical applications. We evaluated BioMedReasoner on PrimeKG, a publicly available large-scale multi-modal biomedical KG, and demonstrated its effectiveness in KG completion and downstream reasoning. As a proof of concept, we focus on genes associated with inflammatory bowel disease (IBD), including the JAK family, TNF-$\alpha$, IL17A, and IL17RA. By extracting paths connecting the genes to IBD and to their biological processes and molecular functions, and supplying these paths to a large language model (LLM) while withholding gene names, we show that interpretable graph paths enable biological reasoning and assessment of gene suitability as therapeutic targets. These results highlight the potential of path-based, interpretable biomedical KG reasoning to guide early-stage drug discovery.
Submission Number: 362
Loading