# Readme

This is the code repository of the paper under review at ICLR 2026: *Exploring the Trade-off between Quality and Diversity of Language Models during Reinforcement Learning*.

## Dependencies

```bash
pytorch
tqdm
tensorboard
multiprocessing
rdkit
PyTDC
```

## Running

**Pretraining** the language model on molecules:

```bash
python pretraining.py
```

The pretraining dataset is available at [ChEMBL](https://www.ebi.ac.uk/chembl/).

**RL finetuning** with a molecular generation objective:

```bash
# Default JNK3 objective
python rl-finetuning.py
# Other objectives (QED for example)
python rl-finetuning.py --oracle QED
```

