# SmolLM3 KataGo Fine-Tuning on Modal

This directory fine-tunes `unsloth/SmolLM3-3B-Base` with LoRA on the dataset stored in the Modal volume:

- data volume: `katago-large-datasets`
- dataset path: `katago_large_dataset_v1/train.jsonl` and `katago_large_dataset_v1/eval.jsonl`
- output volume: `katago-smollm-finetunes`

Run the GPU smoke test:

```bash
python3 -m modal run katago/katagolarge/smolLM/modal_smollm_finetune.py
```

Run full training:

```bash
python3 -m modal run katago/katagolarge/smolLM/modal_smollm_finetune.py --mode train
```

Generate explanations for 100 random eval positions:

```bash
python3 -m modal run katago/katagolarge/smolLM/modal_smollm_finetune.py --mode generate
```

The Modal entrypoint uses an A10G, `transformers==4.53.0`, 4-bit loading, bf16 compute, and LoRA adapters. Checkpoints and metrics are saved under `/outputs/<run_name>` in the `katago-smollm-finetunes` Modal volume.
