# ACDC

Automated Capability Discovery with Dominated Novelty Search (ACD-DNS) is an experimentation harness for evolving large language models. The system coordinates task generation, model crossover/mutation, distributed evaluation, and archive management to surface models that excel on emerging capabilities.

## Repository Layout

- `main_acd_dns.py` – Hydra-driven entry point that orchestrates the ACD-DNS optimization loop, Celery workers, and W&B logging.
- `configs/` – Hydra configuration tree for DNS, task generation, Celery infra, fractional GPU scheduling, and evolution operators.
- `ACD/` – Task generation toolkit, prompts, visualization helpers, and seed tasks used to bootstrap the active task pool.
- `dns/` – Dominated Novelty Search utilities including archive management, metrics, prompt templates, and sandbox integrations.
- `crossover/` & `mutation/` – Evolution operators for combining and perturbing model checkpoints.
- `workers/` – Celery worker implementations for model initialization, merging, and evaluation.
- `evaluation/` – Offline analysis notebooks and scripts for benchmark-style assessments of evolved models.
- `utils/` – Shared helpers for Celery setup, filesystem management, sandbox configuration, and infrastructure glue.
- `frac_start_workers.sh` – Convenience script that launches a tmux session with fractional GPU workers plus the coordinating process.

## Getting Started

1. **Install dependencies**
   ```bash
   python -m venv .venv
   source .venv/bin/activate
   pip install --upgrade pip
   pip install -r requirements.txt
   ```

2. **Configure credentials & services**
   - Provide access to model checkpoints referenced in `configs/acd_dns.yaml` (`seed_model_paths`).
   - Ensure the docker sandbox is built by running `docker build -t acd-sandbox:latest docker/sandbox/`
   - Ensure Celery broker/backends (RabbitMQ/Redis by default) are reachable or override via Hydra CLI arguments by running `bash docker_setup.sh`

3. **Ensure the relevant served models are available**
    - Embedding model: `CUDA_VISIBLE_DEVICES=0; python -m vllm.entrypoints.openai.api_server --model intfloat/e5-mistral-7b-instruct --served-model-name e5-mistral-7b-instruct --task embedding --port 8010`
    - Scientist model: `CUDA_VISIBLE_DEVICES=0,1,2,3; python -m vllm.entrypoints.openai.api_server --served-model-name Qwen/Qwen2.5-72B-Instruct --model Qwen/Qwen2.5-72B-Instruct --tensor-parallel-size 4 --port 8001 --enable-prefix-caching --gpu-memory-utilization 0.8`

    To get the address for `vllm_server_host` and `vllm_embedding_server_host` in `acd_dns.yaml`, run `ip a` on the node with the running big model vLLM instance, and get the IP address from second group of rows with "enp0s12" in "inet".

4. **Launch distributed workers and training**
   Set `NUM_WORKERS` in `frac_start_workers.sh` depending on the number of requested GPUs (one worker per GPU for 7B models)
   Use `frac_start_workers.sh` to spawn Celery workers with fractional GPU assignments
   ```bash
   ./frac_start_workers.sh --gpus=2 --workers-per-gpu=2 --session=acd_dns
   ```

## Additional Resources

- `global_task_pool_eval.py` offers utilities for re-evaluating archived models against global task pools.
- `requirements.txt` enumerates Python dependencies for both orchestration and evaluation flows.

## Acknowledgments

This repository is under active development and is as a demonstration for the reproducibility of the ACDC paper.