# SeeTok

## Install

```
pip install -r requirement.txt
```


## Dataset Preparation
We use the Openhermes 2.5 for training: An open dataset of synthetic data for generalist llm assistants, 2023.


## Pre-training
`bash train_custom_qwen.sh`

## Evaluation
`bash qwen_qa_cepe.sh`


