# Stage-1 Probe (Task Sources, With Training)

This probe answers the question: **why can the OLL local rule work on real tasks, even if strong diagonal scalarization fails in Stage-0 random probes?**

Key idea: the sources \(s_t=u_t\odot g_t\) generated by a task are *not* isotropic. They have structure (small output dimension, correlations, etc.). Stage-1 measures hypotheses **on the actual training/evaluation source ensemble**.

## What it does

- Task: Row-by-row MNIST (sequence length 28)
- Model: `TorchLocalRuleRNN` (OLL update)
- Trains for a small number of epochs on a small subset
- Then probes a held-out batch and computes:
  - exact BPTT \(\delta_t\) via the closed-form recursion
  - OLL estimate \(\delta^e_t = (u_t\odot g_t)/(1-\lambda\odot u_t)\)
  - alignment metrics (cosine/sign), source subspace EVR, slowing-down proxies, and resolvent “simplicity” stats:
    - diagonal susceptibility fraction \(\|\mathrm{diag}(R)\|_F^2/\|R\|_F^2\)
    - rank-1 fraction \(\sigma_1(R)^2/\sum_i\sigma_i(R)^2\)

## Run

```bash
python Compare_RNN/analysis/oll_stage1_probe/stage1_probe.py
```

Smaller (fast) run:

```bash
python Compare_RNN/analysis/oll_stage1_probe/stage1_probe.py \
  --hidden 64 --epochs 1 --train-limit 2000 --test-limit 500 --probe-batch 64 \
  --gains "0.8,0.9,1.0,1.1"
```

Output: `plots/oll_stage1_probe_<timestamp>/summary.json`
