Enhancing PEFT Efficiency by Adaptively Reducing Low-Rank Modules

Enhancing PEFT Efficiency by Adaptively Reducing Low-Rank Modules

ACL ARR 2025 February Submission6962 Authors

16 Feb 2025 (modified: 09 May 2025)ACL ARR 2025 February SubmissionEveryoneRevisionsBibTeXCC BY 4.0

Abstract:

As a prominent Parameter-Efficient Fine-Tuning (PEFT) method, LoRA is widely used for efficiently fine-tuning large language models (LLMs). However, LoRA’s uniform insertion of trainable modules to target modules across all layers often results in redundancy in the number of trainable modules, and we contend that reducing the number of these modules can further enhance the efficiency of PEFT. To address this issue, we propose Gradient-Guided Redundancy Reduction ($\mathcal{G}^2\mathcal{R}^2$), a novel module-level approach that adaptively prunes redundant LoRA modules, which boosts fine-tuning efficiency while preserving or even improving performance. Specifically, $\mathcal{G}^2\mathcal{R}^2$ evaluates the contribution and redundancy of trainable modules using a Gradient-Based Redundancy Evaluation score, which leverages gradient information to achieve this. Based on this score, $\mathcal{G}^2\mathcal{R}^2$ progressively eliminates redundant LoRA modules through a Three-Stage Redundancy Reduction Strategy. Extensive experiments demonstrate that $\mathcal{G}^2\mathcal{R}^2$ not only boosts fine-tuning efficiency but also maintains or even surpasses state-of-the-art methods across commonsense reasoning and natural language understanding tasks.

Paper Type: Long

Research Area: Efficient/Low-Resource Methods for NLP

Research Area Keywords: parameter-efficient-training

Contribution Types: NLP engineering experiment

Languages Studied: English

Submission Number: 6962

Loading