In this work, we presented KoRe, an architecture for grounding Large Language Models in external Knowledge Graphs without the token overhead of textualization-based approaches. 
By combining a GNN encoder over 1-hop star subgraphs with a Directional Residual Vector Quantization scheme and a lightweight LoRA adaptation of a frozen Qwen3-8B backbone, KoRe compresses structured factual knowledge into $20$ discrete tokens per entity, reducing up to $10$× the used tokens compared to serializing the same graph as natural language.
We evaluated our models across three benchmarks and demonstrated that compact, discrete knowledge representations can effectively convey factual content to modern LLMs, achieving competitive or superior accuracy to textualization while dramatically reducing context bloating. 

