Keywords: Evolutionary Learning, Optimization, NLP
TL;DR: We propose MultiGA, a genetic optimization framework that seeds populations with outputs from multiple LLMs and uses an independent evaluator, leveraging complementary strengths to achieve high accuracy across diverse tasks.
Abstract: Large Language Models (LLMs) are widely used across various research domains to tackle complex tasks, but their performance can vary significantly depending on the task at hand. Compared to fine-tuning, inference-time optimization methods offer a more cost-effective way to improve LLM output. Evolutionary algorithms can be used to refine solutions iteratively, mimicking natural selection. To the best of our knowledge, there has not been exploration on leveraging the collective capabilities of multi-source seeding for LLM-guided genetic algorithms. In this paper, we introduce a novel approach, MultiGA, which applies genetic algorithm principles to address complex natural language tasks and reasoning problems by sampling from a diverse population of LLMs to initialize the population. MultiGA generates a range of outputs from various parent LLMs, open source and closed source, and uses a neutral fitness function to evaluate them. Through an iterative recombination process, we mix and refine these generations until an optimal solution is achieved. Our results show that MultiGA converges to the accuracy of the LLM best fit for the task, and these insights lay the foundation for future research looking closer at integrating multiple LLMs for unexplored tasks in which selecting only one pre-trained model is unclear or suboptimal. We benchmark our approach using text-to-SQL code generation tasks, trip planning, GPQA benchmark for grad-level science questions, and the BBQ benchmark that measures bias in models. This work contributes to the growing intersection of evolutionary computation and natural language, highlighting the potential of biologically inspired algorithms to improve generative artificial intelligence selectivity and accuracy.
Submission Number: 241
Loading