# ES-LLM

This repository contains experimental code for fine-tuning Large Language Models (LLMs) using Evolution Strategies (ES).

This is part of the supplementary material for paper "Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning"

## Usage

There are three folders containing codes for different purposes:

- "countdown" contains the code to reproduce the experimental results related to the Countdown task. (Section 4.1)
- "conciseness" contains the code to reproduce the experimental results related to fine-tuning for conciseness (Section 4.2)
- "latest" contains the newest code that is optimized for GPU memory efficiency, supporting parallelizaion within and across GPUs.
