# Project README

This repository contains the following directories:

- **SFT** – Supervised Fine-Tuning implementation.
- **DPO** – Code related to Direct Preference Optimization.  
- **eval** – Evaluation scripts and metrics.  
- **process_data** – Data preprocessing utilities.  


## Data Note

Due to file size limitations, only a small set of demo data is included in this repository.  
We plan to open-source our full dataset in the near future—stay tuned!