# TruthfulQA Evaluation for Qwen3-8B

This repository contains scripts to evaluate the Qwen3-8B model different benchmarks like TruthfulQA, MMLU, and Yourbench MMLU.

Run the evaluation script:

For running on a node with 8 GPUs:
```bash
accelerate launch --config_file zero3_eight_gpu.yaml mmlu_llama_try.py
```

For running on 1 GPU:
```bash
accelerate launch --config_file zero3_config.yaml --num_processes 1 single_gpu_runs.py
```

