Steering Merged LLMs for Multilingual Reasoning with Coefficient Optimization

18 Sept 2025 (modified: 18 Nov 2025)ICLR 2026 Conference Withdrawn SubmissionEveryoneRevisionsBibTeXCC BY 4.0
Keywords: multilingual reasoning, LLM, steering, model merging
TL;DR: Steering Merged LLMs for Multilingual Reasoning with Coefficient Optimization
Abstract: Model merging has recently proven effective in enhancing the cross-lingual capabilities of reasoning models by integrating them with multilingual language models. However, existing methods typically apply a uniform merging strategy across languages, leading to a trade-off: while low-resource languages may benefit, high-resource languages such as English often suffer performance degradation. We attribute this limitation to insufficient coordination between the multilingual and reasoning models, where suboptimal representation merging impairs generalization. To mitigate this, we introduce **SteMerger**, a preference-driven framework that dynamically **steers** the model **merger** by optimizing the merging coefficients. Experiments on multilingual reasoning benchmarks show that SteMerger consistently improves performance across a wide range of languages, outperforming several strong baselines.
Primary Area: applications to computer vision, audio, language, and other modalities
Submission Number: 10691
Loading