# Experiments on Modern LLMs

## Install

```bash
sudo apt install -y rdma-core libibverbs1 libmlx5-1 libibverbs-dev
python -m venv .venv
source .venv/bin/activate
pip install -r requirements.txt
```

If `torchmonarch` is not available on your platform, build it from source:

```bash
bash build_monarch.sh
```

## Data

- Contexts: `contexts_modified.jsonl` (or `contexts.jsonl`)
- Optional eval parquet for cross-entropy: update the path in `new_run.sh`

## Run (end-to-end)

Edit the paths and model lists at the top of `new_run.sh`, then run one experiment as follows:

```bash
bash new_run.sh
```

## Run (individual experiments)

```bash
python generate_prompts.py --model_generating_concept MODEL --models MODEL_A MODEL_B --concepts C1 C2 --out_dir prompts --seed 4
python generate_steering_vectors.py --models MODEL_A MODEL_B --in_dir prompts --save_dir steering_vectors --seed 4
python generate_plot_data.py --models MODEL_A MODEL_B --steer_dir steering_vectors --contexts_file contexts_modified.jsonl --out_dir plot_data --seed 4 --layers 3
```

Optional scripts:

```bash
python generate_margin.py --models MODEL_A MODEL_B --prompts_dir prompts --out_dir margins
python generate_cross_entropy.py --models MODEL_A MODEL_B --steer_dir steering_vectors --eval_parquet /path/to/data.parquet --out_dir cross_entropy --seed 4 --layers 3
python generate_mmlu.py --models MODEL_A MODEL_B --tasks HIGH_SCHOOL_COMPUTER_SCIENCE --steer_dir steering_vectors --out_dir mmlu --seed 4
python generate_baheviour.py --models MODEL_A MODEL_B --judge_model MODEL --steer_dir steering_vectors --contexts_file contexts_modified.jsonl --out_dir behaviour_data --alpha_start -40 --alpha_end 40 --alpha_steps 41 --n_samples_per_context 12 --max_new_tokens 100 --layers 2 4 5 7 9 11 --gen_context_batch_size 12
```
