# Pre-requsites

Train all the models - NLU, NLG, Baseline, ProbMedTOD (Bayesnet) and RAG.
Create soft-links to relevant model checkpoints in `clients/models/` as shown below.

```
client/models/nlu.yml --> NLU config.
client/models/nlu/ --> NLU checkpoint from Unsloth training.
```

# Start OpenAI style server.

Start a new terminal and use VLLM to run Gemma3-27B model.

```bash
vllm serve "unsloth/gemma-3-27b-it-unsloth-bnb-4bit" --dtype=bfloat16 --quantization=bitsandbytes --load-format=bitsandbytes --port=8000 --host=0.0.0.0 --max-model-len=8192
```

You can also use other API endpoints as long as they are OpenAI-client compatible.

# Start Patient Server

Start a new terminal, go to `./services` and run the following commands.
```bash
export OPENAI_BASE_URL=http://localhost:8000/v1/
export OPENAI_MODEL=unsloth/gemma-3-27b-it-unsloth-bnb-4bit
export OPENAI_API_KEY=dummy123

python -u main.py --port 8001 --dst_key dst --ddx_key differential --dialog_path ../data/test.json
```
Make sure you use correct URLs and KEYs for your LLM endpoints.


# Run simulation

Start a new terminal and go to `clients`. Run the following command to start the simulation.

```bash
bash start.sh <MODEL>
```
where MODEL can be llama3, rag or bayesnet.