# PyTorch implementation of **IDLM: Inverse-distilled Diffusion Language Models**

This repository contains a PyTorch implementation of the paper **"IDLM: Inverse-distilled Diffusion Language Models"**
## Project structure

- `Duo/`  
  Distillation code for **Duo** and **MDLM**.

- `Score-Entropy-Discrete-Diffusion/`  
  Distillation code for **SEDD** (Score-Entropy Discrete Diffusion).

## Distillation notes

### Duo + MDLM distillation (`Duo/`)
The distillation implementation for **Duo** and **MDLM** is based on the official Duo repository.

To train **IDLM distillation** using Duo/MDLM:

1. **Use the `Duo/` folder**
   - All training and distillation code lives inside `Duo/`.

2. **Create the same environment as Duo**

3. **Configure dataset cache path**
   - Open:
     `configs/data/openwebtext-split.yaml`
   - Fill in `cache_dir` with the dataset path:
     `cache_dir: "path-to-dataset"`
#### Distill **MDLM**
To distill **MDLM**, edit:
- `scripts/adversarial_distill_mdlm.sh`


Fill in the following fields:
- `eval.checkpoint_path='path-to-checkpoint'`
- `logger.name=name-of-exp`
- Run:
```bash
sh scripts/adversarial_distill_mdlm.sh
```

#### Distill **Duo**
To distill **Duo**, edit:
- `scripts/adversarial_distill.sh`

Fill in the following fields:
- `eval.checkpoint_path='path-to-checkpoint'`
- `logger.name=name-of-exp`
- Run:
```bash
sh scripts/adversarial_distill.sh
```

### SEDD distillation (`Score-Entropy-Discrete-Diffusion/`)
To distill **SEDD**:


1. **Go to the `Score-Entropy-Discrete-Diffusion/` directory**
- All training and distillation code for SEDD lives here.


2. **Install the SEDD environment**


3. **Set the teacher checkpoint path**
- Open: `Score-Entropy-Discrete-Diffusion/configs/config.yaml`
- Fill in:
```yaml
distill_model: "path-to-checkpoint"
```


4. **Set the experiment output directory / name**
- Open: `Score-Entropy-Discrete-Diffusion/absorbing_train.sh`
- Fill in:
```bash
hydra.run.dir=name-of-exp
```


5. **Start training**
- Run:
```bash
sh absorbing_train.sh
```
