Local Large Language Models for Recommendation

Yujin Jeon, Jooyoung Kim, Joonseok Lee

Published: 10 Nov 2025, Last Modified: 06 Dec 2025CrossrefEveryoneRevisionsCC BY-SA 4.0

Abstract: Unlike traditional classification tasks, recommendation is inherently subjective-whether an item should be suggested depends not only on user preferences and item semantics, but also on latent behavioral patterns and contextual cues. While recent LLM-based recommenders excel at modeling semantics and intent through generative reasoning, they often fail to capture collaborative signals and suffer from inefficiencies when applied globally across large interaction spaces. We propose Local Large Language Models for Recommendation(L3Rec), a novel model-agnostic framework that integrates collaborative filtering(CF) with generative LLMs through localized modeling. Our approach first applies a light-weight CF model to derive user and item embeddings, then clusters them into behaviorally coherent subgroups. Each cluster is assigned a dedicated generative LLM-referred to as a local LLM-trained only on its corresponding data subset. This enables fine-grained personalization while improving training efficiency through parallelism. At inference time, predictions from local models are aggregated via a fusion strategy, with a global CF fallback when needed. To the best of our knowledge, this is the first LLM-based recommendation framework to incorporate local collaborative structure. Experiments show that it achieves state-of-the-art performance with significantly better scalability and efficiency.

External IDs:doi:10.1145/3746252.3761280