# ThinkTime Warm-Up SFT Training
This repo includes the source code for Warm-Up SFT training of ThinkTime-14B with the implementation of RL for iTCoT.

## Installation
1. Run `pip3 install -r requirements.txt`

## Steps to Reproduce
1. Make sure that you have follow the previous steps in [ThinkTime Folder](../ThinkTime/README.md). All the datasets for Warm-Up SFT are successfully generated.
2. Set the dataset path in `data/dataset_info.json`
3. Set the model path params in `scripts/train_thinktime.sh`. (Make sure that you have downloaded the ChatTS-14B-0801 model)
4. Run `bash scripts/train_thinktime.sh`

## Reference
This code is built on LLaMA-Factory (https://github.com/hiyouga/LLaMA-Factory) and ChatTS-Training. We will comply with the relevant license requirements and open-source the code after acceptance of this paper.
