InstOptima: Evolutionary Multi-objective Instruction Optimization via Large Language Model-based Instruction Operators

Heng Yang; Ke Li

InstOptima: Evolutionary Multi-objective Instruction Optimization via Large Language Model-based Instruction Operators

Heng Yang, Ke Li

Published: 07 Oct 2023, Last Modified: 01 Dec 2023EMNLP 2023 FindingsEveryoneRevisionsBibTeX

Submission Type: Regular Short Paper

Submission Track: Theme Track: Large Language Models and the Future of NLP

Submission Track 2: Interpretability, Interactivity, and Analysis of Models for NLP

Keywords: instruction optimization, automated instruction generation, evolutionary multi-objective optimization, language model-based operators

TL;DR: This paper propose a evolutionary multi-objective optimization-based instruction generation method

Abstract: Instruction-based language modeling has received significant attention in pretrained language models. However, the efficiency of instruction engineering remains low and hinders the development of instruction studies. Recent studies have focused on automating instruction generation, but they primarily aim to improve performance without considering other crucial objectives that impact instruction quality, such as instruction length and perplexity. Therefore, we propose a novel approach (i.e., InstOptima) that treats instruction generation as an evolutionary multi-objective optimization problem. In contrast to text edition-based methods, our approach utilizes a large language model (LLM) to simulate instruction operators, including mutation and crossover. Furthermore, we introduce an objective-guided mechanism for these operators, allowing the LLM to comprehend the objectives and enhance the quality of the generated instructions. Experimental results demonstrate improved fine-tuning performance and the generation of a diverse set of high-quality instructions.

Submission Number: 5874

Loading