Catalog-Native LLM: Speaking Item-ID dialect with Less Entanglement for Recommendation

Catalog-Native LLM: Speaking Item-ID dialect with Less Entanglement for Recommendation

ICLR 2026 Conference Submission6134 Authors

Published: 26 Jan 2026, Last Modified: 26 Jan 2026ICLR 2026EveryoneRevisionsBibTeXCC BY 4.0

Keywords: Recommender Systems, Large Language Models, Mixture of Experts

TL;DR: “IDIOMoE decouples item-ID and language processing inside an LLM via token-type MoE, reducing interference and improving recommendation quality at roughly the same compute.”

Abstract: While collaborative filtering delivers predictive accuracy and efficiency, and Large Language Models (LLMs) enable expressive and generalizable reasoning, modern recommendation systems must bring these strengths together. Growing user expectations, such as natural-language queries and transparent explanations, further highlight the need for a unified approach. However, doing so is nontrivial. Collaborative signals are often token-efficient but semantically opaque, while LLMs are semantically rich but struggle to model implicit user preferences when trained only on textual inputs. This paper introduces Item-ID + Natural-language Mixture-of-Experts Language Model (IDIOMoE), which treats item interaction histories as a native dialect within the language space, enabling collaborative signals to be understood in the same way as natural language. By splitting the Feed Forward Network of each block of a pretrained LLM into a separate text expert and an item expert with token-type gating, our method avoids destructive interference between text and catalog modalities. IDIOMoE demonstrates strong recommendation performance across both public and proprietary datasets, while preserving the text understanding of the pretrained model.

Primary Area: other topics in machine learning (i.e., none of the above)

Submission Number: 6134

Loading