# Adversarial-Prompt-Translator


## Environments
* Python 3.8.8
* PyTorch 2.4.0
* transformers 4.44.2
* vllm 0.5.4
* openai 1.44.1

## Usage
To generate adversarial prompts using Llama-3.1-8B as the translator LLM on the HarmBench dataset, run:
```
translate.py --translator llama3.1-8b --dataset harmbench
```
The adversarial prompts will be saved in ```results/trans_harmbench_llama3.1-8b.json```.
