EWoRA: Expert Weighted Low-Rank Adaptation for Reasoning over Heterogeneous Data

Harsh Kohli; Helian Feng; Lenon Minorics; Bhoomit Vasani; Xin He; Ali Kebarighotbi

EWoRA: Expert Weighted Low-Rank Adaptation for Reasoning over Heterogeneous Data

Harsh Kohli, Helian Feng, Lenon Minorics, Bhoomit Vasani, Xin He, Ali Kebarighotbi

Published: 16 Oct 2025, Last Modified: 10 Nov 2025NeurIPS 2025 ER WorkshopEveryoneRevisionsBibTeXCC BY 4.0

Keywords: LoRA, Parameter Efficient Finetuning, low-rank adaptation, PEFT

TL;DR: The paper proposes a new LoRA variant for efficient finetuning which works better for heterogenous or diverse finetuning data.

Abstract: Low-Rank Adaptation (LoRA) has emerged as a widely adopted parameter-efficient fine-tuning (PEFT) approach for language models. By restricting weight updates to a low-rank subspace, LoRA achieves cost-effective finetuning of large, generalist models to more specialized target domains. While LoRA achieves impressive results for a variety of individual downstream tasks, it struggles to capture the diverse expertise needed when presented with a more heterogeneous finetuning corpus. To address this, we propose Expert Weighted Low-Rank Adaptation (EWoRA), a novel LoRA variant that partitions a rank-(r) adapter into (n) independent adapters of rank (r/n). A lightweight “routing” matrix $(\mathbf{W}_r \in \mathbb{R}^{r \times n})$ aggregates the outputs of these adapters by learning specialized weights for each context. Experiments show EWoRA improves performance over LoRA when finetuning on heterogeneous data while generally matching or exceeding LoRA performance on individual finetuning tasks under the same low-rank parameter budget.

Submission Number: 90

Loading