# TTSRouter

## Abstract

TTSRouter is a data analysis and upper-level call library. It loads and calls the underlying system **ttsrouterv-1.1** and completes the overall workflow. For more information, please refer to our paper.

## Directory Structure

```table
TTSRouter/
    cache/                      # cache for evaluation
    _plot_cumulative_reward.py
    ablation_study.py
    cache_utils.py
    calculate_reward_stats.py
    config.py
    generate_mock_results.py
    generate_query_embeddings.py
    knn_bandit.py
    lin_ucb.py
    main.py
    mixed_ucb.py
    nural_ucb.py
    precompute_embeddings.py
    random_baseline.py 
    README.md
    reward_function.py                  
    tts_contextual_bandit.py
    verify_utils.py         
```

## Environment Setup

We have already set up tts in ttsrouter-v1.1, so you can use it directly.

```bash
cd TTSRouter
conda activate tts
```

## Quick Start

1. **Prepare Action Embeddings** (optional when using provided cache):

```bash
python precompute_embeddings.py
```

2. **Run Bandit on a Dataset**:

```bash
python main.py --algorithm lin_ucb --data_path path/to/test150.jsonl --offline_embeddings cache/query_embeddings_combined.json --offline_results cache/pruned_question_trials_combined.csv --virtual_dataset
```

3. **Batch Experiments** :

```bash
python run_evaluation.py
```

## Offline vs Online Execution

- **Offline Replay**: Provide `--offline_embeddings` and `--offline_results` to reuse cached embeddings, verifier scores, and token statistics. This mode reproduces experiments deterministically without hitting APIs.
- **Online Mode**: Omit offline arguments to fetch embeddings with DashScope-compatible endpoints and execute live completions or verifications. Results are cached under `cache/query_cache.json` and `logs/<dataset>/`.

## Logging & Metrics

- Bandit trajectories: `logs/<dataset>/bandit_process_*.log` summarise per-query decisions, rewards, and accuracy diagnostics.
- Training diagnostics: CSV files (e.g., `logs/<dataset>/loss_log_*.csv`) track reward decomposition, regret, and eFLOPs for each step.
- UCB exploration traces: `logs/<dataset>/ucb_log_*.csv` record predicted value, exploration bonus, and chosen action.