# Investigating Risk-Aware Decision Making in Language Models

## Setup (with `uv` venv)

1. **Install `uv`** (if you haven't already):
   ```bash
   curl -LsSf https://astral.sh/uv/install.sh | sh
   ```
   Or visit [uv installation docs](https://docs.astral.sh/uv/getting-started/installation/) for other methods.

2. **Sync dependencies**:
   ```bash
   uv sync
   ```
   This will create a virtual environment and install all dependencies from the lockfile.

## API Keys
You need to set the following environment variables in the `.env` file as follows:
```
OPENAI_API_KEY="<YOUR_OPENAI_API_KEY>"
GEMINI_API_KEY="<YOUR_GEMINI_API_KEY>"
ANTHROPIC_API_KEY="<YOUR_ANTHROPIC_API_KEY>"
```

## Datasets
Please download the `SimpleQA (verified)` ([Haas et al., 2025](https://arxiv.org/abs/2509.07968)) dataset from [Kaggle Datasets](https://www.kaggle.com/datasets/deepmind/simpleqa-verified/data) to `./data/simpleqa_verified.csv` before running the code.

## Run the code
### Multi-Choice Questions
```bash
uv run run_mcqa.py \
    --model "gpt-4o-mini-2024-07-18" \
    --dataset "gpqa" \
    --prompt_type "risk_informing" \
    --r_cor "1" \
    --r_inc "-4" \
    --r_ref "0"
```

### Pure Gambling
```bash
uv run run_pure_gambling.py \
    --model "gpt-4o-2024-08-06" \
    --p_cor "1" \
    --p_inc "-8" \
    --p_ref "0"
```

### Open-Ended / Free-Form QA
```bash
uv run run_freeform_qa.py \
    --model "gpt-4o-mini-2024-07-18" \
    --dataset "simple_qa_verified" \
    --prompt_type "stepwise" \
    --r_cor "1" \
    --r_inc "-8" \
    --r_ref "0" \
    --grader_model "gpt-4.1" \
    --grader_base_url "https://api.openai.com/v1"
```