# S^2-Bench  

Our benchmark is at ```./data/benchmarks/open_generation```.    
Our instruction tuning dataset is at ```./data/instruction_tuning```.  

## Train (Finetune) model:  
see  
```
train_copy.sh
```  

## Test model:  
1. Close-source model:  ```run_query_openai.sh```  
2. Open-source instruction model or fully finetuned model: ```run_query_model.sh```  
3. Finetuned model with LORA: ```run_query_model_lora.sh```