# README

Online WPM allocation


## Run trends experiments

**Run all experiments**: Run ```bash run_expt.sh``` to run all experiments and generate plots.

### Individual experiments

**Trends with horizon $T$**: Run

```bash
bash run_expt.sh --weight-type [linear/geometric] --experiments T
```
or directly:
```
```python
python expt_trends_T.py --weight-type [linear/geometric] --objectives [wgk]
```
Here, objectives can be any subset of w (WPM), g (Gini), k (Kolm). For example, to run with WPM and Gini objectives and geometric weights, use:

```python
python expt_trends_T.py --weight-type geometric --objectives wg
```

To generate plots after running the experiments, run:
```bash
bash run_expt.sh --weight-type [linear/geometric] --experiments T --plot-only
```
or directly:
```python
python plot_trends_T.py --weight-type [linear/geometric] --runs-dir [dirname] --add-metadata --no-show
```

`--runs-dir` specifies the directory where the experiment runs are logged. If not provided, it defaults to the latest directory with the given `weight-type`.

`--add-metadata` adds experiment metadata to the plots.

`--no-show` saves the plots without displaying them.

**Trends with number of arms $k$**: Run

```bash
bash run_expt.sh --weight-type [linear/geometric] --experiments k
```
or directly:
```python
python expt_trends_k.py --weight-type [linear/geometric] --objectives [wgk]
```

To generate plots after running the experiments, run:
```bash
bash run_expt.sh --weight-type [linear/geometric] --experiments k --plot-only
```
```python
python plot_trends_k.py --weight-type [linear/geometric] --runs-dir [dirname] --add-metadata --no-show
```

**Trends with power value $q$ (for WPM/Kolm)**: Run

```bash
bash run_expt.sh --weight-type [linear/geometric] --experiments q
```
```python
python expt_trends_q.py --weight-type [linear/geometric] --objectives [wk]
```

To generate plots after running the experiments, run:
```bash
bash run_expt.sh --weight-type [linear/geometric] --experiments q --plot-only
```
or directly:
```python
python plot_trends_q.py --weight-type [linear/geometric] --runs-dir [dirname] --add-metadata --no-show
```

## Run UCB Experiments

```bash
python ucb_expt.py [OPTIONS]
```

### Command Line Arguments

| Argument | Type | Default | Description |
|----------|------|---------|-------------|
| `--n-arms` | int | 20 | Number of arms for bandit |
| `--n-rounds` | int | 5000 | Number of rounds per experiment |
| `--num-alloc` | int | 10 | Number of arms (k) to allocate per round |
| `--n-experiments` | int | 5 | Number of independent experiments to run |
| `--alpha` | float | 2.0 | UCB exploration parameter |
| `--seed` | int | 42 | Master random seed |
| `--pow` | str | "-inf" | Power parameter for WPM/Kolm (use '-inf' for egalitarian) |
| `--objective` | str | "wpm" | Objective function: `wpm`, `kolm`, or `gini` |
| `--no-log` | flag | False | Disable experiment logging |
| `--no-show` | flag | False | Don't display plots (just save) |

### Examples

```bash
# Run with all defaults (WPM, egalitarian welfare)
python ucb_expt.py

# Quick test run with fewer rounds
python ucb_expt.py --n-arms 10 --n-rounds 500 --num-alloc 5 --n-experiments 3 --no-show

# Gini objective with custom seed
python ucb_expt.py --objective gini --n-arms 50 --n-rounds 10000 --seed 123

# Nash welfare (power = 0)
python ucb_expt.py --objective wpm --pow 0 --n-arms 20 --n-rounds 5000

# Kolm objective (requires pow <= 0)
python ucb_expt.py --objective kolm --pow -1.0 --n-arms 30 --num-alloc 15

# Run without displaying plots (useful for remote servers)
python ucb_expt.py --no-show --no-log
```

### Power Parameter Values

The `--pow` argument controls the fairness-efficiency tradeoff for WPM and Kolm objectives:

| Value | WPM Interpretation | 
|-------|-------------------|
| `-inf` | Egalitarian (maximin) - maximize minimum utility |
| `0` | Nash welfare - geometric mean |
| `1` | Utilitarian - sum of utilities |

For Kolm, `--pow` must be ≤ 0.

-----


## File structure 

### File tree

```
├── src
│ ├── __init__.py   # makes src a package
│ ├── logger.py     # log runs
│ ├── armsets.py
│ ├── sampler.py
│ ├── swf_ucb.py
│ ├── ucb_expt.py
│ ├── gini.py
│ ├── kolm.py
│ ├── wpm.py
│ └── old_files/
├── lib
│ ├── bootstrap  
│ ├── clipboard  
│ └── quarto-html
├── README.md
├── result_presentation.qmd
├── result_presentation.html
```
**logger output**

```
├── runs/                # auto-created by logger
├── plots/               # auto-created by experiments
```

--- 

## Logging workflow

workflow has 3 layers:

1. `ucb_expt.py` — runs experiments and calls logger
2. `logger.py` — saves experiment metadata + results to JSON files under `runs/` directory
3. `analyze_runs.py` — loads those JSON files and generates plots/tables

### Usage

** note:** `--group-by` lets you compare runs along any config dimension (objective, n_arms, num_alloc, etc.).

```bash
# See a summary table of all logged runs
python analyze_runs.py --summary

# Plot cumulative regret, grouped by objective (wpm vs kolm vs gini)
python analyze_runs.py --plot-cumulative --group-by objective

# Plot per-step regret with smoothing
python analyze_runs.py --plot-perstep

# Filter to only WPM runs
python analyze_runs.py --objective wpm --plot-cumulative

# Generate a LaTeX table for your paper
python analyze_runs.py --latex

# Save plots to files (instead of just displaying)
python analyze_runs.py --plot-cumulative --save
```

#### Sample end-to-end run
```bash
# Run a quick experiment (will create runs/ and plots/ directories)
python ucb_expt.py --objective wpm --n-rounds 1000 --no-show

# Check what got logged
ls runs/
cat runs/*.json | head -50

# Run another with different objective
python ucb_expt.py --objective gini --n-rounds 1000 --no-show

# Now compare them
python analyze_runs.py --summary
python analyze_runs.py --plot-cumulative --group-by objective
```

----


## Testing

### Test Structure

```
tests/
├── __init__.py
├── conftest.py          # shared fixtures
├── test_wpm.py          # WPM SWF + solver tests
├── test_kolm.py         # Kolm SWF + solver tests  
├── test_gini.py         # Gini SWF + solver tests
└── test_integration.py  # end-to-end pipeline tests
```

#### What's Tested

| Category | Tests |
|----------|-------|
| **Mathematical correctness** | Known analytical solutions (pow=0 is geometric mean, pow=-inf is min, etc.) |
| **Invariants** | Allocations sum to `num_alloc`, probabilities in [0,1] |
| **Edge cases** | Single arm, equal utilities, `num_alloc == n_arms` |
| **Integration** | Full UCB loop runs without error, explores all arms, counts increase |
| **Reproducibility** | Same seed → same results |

<!-- TODO| **Sampler properties** | Sum preservation, binary output, marginal probabilities (statistical) | -->

#### Usage
To run test suite, enter one of the following in your terminal:
```bash
make test              # run all tests
make test-verbose      # with verbose output
make test-fast         # stop on first failure
make test-unit         # just unit tests (no integration)
make test-integration  # just integration tests
```

Or directly: `pytest -v`

----

### SWF implementations:
*  `gini.py` -- Generalized Gini Index:

  $$\begin{align*}
  M_{\text{Gini}}(\bar{\mu} \odot \bar{p}; \bar{w}) = \sum_i w_i (\mu p)_{(i)}
  \end{align*}$$

   where $(\mu p)_{(i)}$ denotes the $i$-th smallest value of $\bar{\mu} \odot \bar{p}$.

* `kolm.py` -- Kolm social welfare:

$$\begin{align*}
M_{\text{Kolm}}(\bar{\mu} \odot \bar{p}; \bar{w}, \alpha) = \frac{1}{\alpha} \log \paren{\sum_i w_i \exp(\mu_i p_i)}
\end{align*}$$

* `wpm.py` -- Weighted power mean (WPM):

$$\begin{align*}
M_{\text{WPM}}(\bar{\mu} \odot \bar{p}; \bar{w}, q) = \paren{\sum_i w_i (\mu_i p_i)^q}^{1/q} 
\end{align*}$$

where $w_i$ are such that $w_1 \geq w_2 \geq \ldots \geq w_n$.

> ** NOTE: we use `\bar{}` to indicate vector quantities because markdown Latex does not render `\mathbb{}` correctly. **


-------

# Setup 

#### install requirements

```bash 
pip install -r requirements.txt
```


#### install test dependencies

Dev dependencies include pytest, ruff, cvxpy, and ecos (used for validating solvers against LP solutions).

```bash 
pip install -e ".[dev]"
```
