Differentiable DARE-TIES for NeurIPS 2024 LLM Merging Competition

Toshiyuki Nishimoto; Yoichi Hirose; Yuya Kudo; Nozomu Yoshinari; Rio Akizuki; Kento Uchida; Shinichi Shirakawa

Differentiable DARE-TIES for NeurIPS 2024 LLM Merging Competition

Toshiyuki Nishimoto, Yoichi Hirose, Yuya Kudo, Nozomu Yoshinari, Rio Akizuki, Kento Uchida, Shinichi Shirakawa

Published: 12 Dec 2024, Last Modified: 12 Dec 2024LMC 2024 OralEveryoneRevisionsBibTeXCC BY 4.0

Keywords: LLM, Model Merging, DARE, TIES, LoRA

Abstract: With the increasing training cost of large language models, model merging is attracting attention. This report describes our effort in the area during the NeurIPS 2024 LLM Merging Competition. We developed differentiable DARE-TIES, which optimizes the merging parameters in a differentiable manner. Whereas existing methods rely on black-box optimization algorithms, our method utilizes gradient descent and is expected to optimize high-dimensional merging parameters more efficiently. We conducted experiments to examine the potential of our approach.

Submission Number: 5

Loading