***Dependency***
```
python==3.9
pytorch==1.12.1
pytorch-lightning==1.9.4
rdkit==2021.09.5
transformers==4.26.1
```

***Dataset***

Place the data in {dataset}/train.txt

The first row of {dataset}/train.txt should be "SMILES description".

From the second row, place {SMILES} "The molecule is a [S*]" for each SMILES string of molecules.

***Hierarchical molecular inversion***
```
python himol.py {dataset} {output_folder} {num_epochs} {num_samples}
```
For example,
```
python himol.py hiv hiv_out 1000 1232
```


***Embedding interplation-based sampling***

Before generation, move modeling_t5.py to /in/your/environment/transformers/models/t5/modeling_t5.py

```
python generation.py
```

