Interpolated Layer-Wise Merging for NeurIPS 2024 LLM Merging Competition

Published: 12 Dec 2024, Last Modified: 12 Dec 2024LMC 2024 OralEveryoneRevisionsBibTeXCC BY 4.0
Keywords: model merging, large language models
Abstract: Model merging is an emerging technique to generate models by combining multiple models, leveraging their individual strengths in a unified model. Because the configuration hyperparameters of model merge techniques have a significant effect on the merged models, optimizing them using evolutionary algorithms is promising to enhance the model merge results. In our participation in the LLM Merging Competition in NeurIPS 2024, we introduced a novel search space, interpolated layer-wise space, for optimizing merging configurations using evolutionary algorithms. This paper explores the potential for performance enhancement in merged models and the efficiency of our search space for evolutionary model merge.
Submission Number: 7
Loading

OpenReview is a long-term project to advance science through improved peer review with legal nonprofit status. We gratefully acknowledge the support of the OpenReview Sponsors. © 2025 OpenReview