# CMI Loss Implementation

This repository contains the code for our paper on CMI (Conditional Mutual Information) Loss, a novel approach to improve AI safety through regularization during supervised fine-tuning.

## Repository Structure

```
code/
├── train/           # Training implementation with CMI Loss
│   └── cmi-loss-opensource/
│       ├── cmi_loss/    # Core CMI Loss implementation
│       ├── data/        # Training data samples
│       └── others/      # other useful code
│
└── eval/            # Evaluation benchmarks
    ├── reasoning_benchmark/    # Mathematical reasoning evaluation
    │   ├── datasets/          # AIME, MATH test sets
    │   └── sampler/          # Model inference utilities
    │
    └── safe_and_or/    # Safety evaluation
        ├── dataset/    # Safety benchmark datasets
        └── eval.py     # Safety evaluation scripts
```

## Note

This is a reference implementation. Full experiments require large-scale computational resources. See individual README files in each directory for detailed information.
