LoRA Perturbations
==================

This repository contains experiments and plotting utilities for studying how
low-rank perturbations (LoRA-style updates) affect a pretrained language model.
The primary model used throughout is `Qwen/Qwen3-0.6B`. The codebase includes:

- Perturbation experiments that inject low-rank noise into linear layers and
  measure loss/perplexity changes.
- LoRA fine-tuning experiments to quantify forgetting and compare against full
  fine-tuning.
- Plotting utilities and analysis notebooks for generating figures.
- Precomputed plots and summary artifacts under `results/` and `sweep_1614967/`.


Quickstart
----------

1) Create an environment (Python 3.11+ required):

```
python -m venv .venv
source .venv/bin/activate
pip install -e .
```

If you prefer `uv`, you can also use the existing lock file:

```
uv sync
```

2) Run a perturbation grid search:

```
python experiments/perturbation/grid_search.py \
  --ranks 4 8 16 128 256 \
  --trials 25 \
  --output-json results/perturbation/perturbations.json \
  --output-plot results/perturbation/perturbations.png
```

3) Plot results (original scatter or binned mean +/- std):

```
python experiments/perturbation/plot_results.py \
  results/perturbation/perturbations.json \
  --output results/perturbation/perturbations.png \
  --no-show
```

```
python experiments/perturbation/plot_results_exploration.py
```


Repository Layout
-----------------

Top-level scripts and assets:

- `diagrams.py`: Generates schematic figures saved as PDFs/PNGs.
- `download_cache.py`: Utility for prefetching model/dataset assets.
- `sweep.slurm`: Example SLURM script for running sweeps on a cluster.
- `schematic_*.pdf`, `schematic_*.png`: Pre-rendered schematic figures.

Experiments:

- `experiments/perturbation/`
  - `grid_search.py`: Runs repeated low-rank perturbations and logs metrics.
  - `evaluation_runner.py`: Core utilities used by perturbation experiments.
  - `evaluate.py`: Thin wrapper around `evaluation_runner.py`.
  - `plot_results.py`: Standard scatter plot renderer.
  - `plot_results_exploration.py`: Exploratory binned-mean plot and analysis.
  - `grid_search_overview.md`: Notes on the perturbation workflow and formulas.
- `experiments/lora/`
  - `lora_finetune.py`: LoRA fine-tuning with periodic evaluation.
  - `run_lora_sweep.py`: Sweeps LoRA ranks and aggregates metrics.
  - `plot_eval_vs_heldout.py`, `plot_loss_ratio_vs_steps.py`,
    `plot_lora_sweep_norms.py`, `compute_lora_eigen_alignment.py`:
    Plotting and analysis tools for LoRA sweeps.
  - `chat_trained_model.py`: Helper for chatting with a trained adapter/model.

Results:

- `results/perturbation/`: Example perturbation logs and plots.
- `sweep_1614967/`: Example LoRA sweep outputs and plots.


Perturbation Experiments
------------------------

The perturbation pipeline draws low-rank Gaussian bases for each linear layer,
rescales the combined perturbation to a target Frobenius norm, applies it to the
model, and records the resulting loss and perplexity.

Key entry points:

- `experiments/perturbation/grid_search.py`: main experiment driver
- `experiments/perturbation/plot_results.py`: scatter plot
- `experiments/perturbation/plot_results_exploration.py`: binned mean +/- std

Default dataset:

- `wiki40b`, config `fr`, split `test`, text column `text`


LoRA Fine-Tuning Experiments
----------------------------

The LoRA pipeline fine-tunes `Qwen/Qwen3-0.6B` and measures forgetting on a
held-out evaluation dataset.

Key entry points:

- `experiments/lora/lora_finetune.py`
- `experiments/lora/run_lora_sweep.py`

Example sweep:

```
python experiments/lora/run_lora_sweep.py \
  --r-values 4 8 16 32 \
  --output-root results/lora/sweeps \
  --run-tag sweep_example \
  --save-adapter
```


Notes and Tips
--------------

- These experiments download large models and datasets. A GPU is recommended.
- Set `--device cpu` for CPU runs (slow) or `--device cuda` for GPU.
- The plotting scripts expect JSON logs produced by the corresponding runner.
- If you add new runs, keep outputs under `results/` for consistency.


License
-------

This repository does not currently include a license file.
