## config env
```python
conda create -n llmenv python=3.8
conda activate  llmenv
pip install -r requirements.txt
```

## Usage


```python
python -u train_policy.py --datasets=advbench --env-name=exp_name --target_model=lmsys/vicuna-7b-v1.3 &> vicuna7b_train.log
python -u generate_resp_analysis.py --model_path=lmsys/vicuna-7b-v1.3 --RL=1 &> RL_vicuna7b_result_agent.log
```


