# OLL “Scalar Echo” — Axioms → Resolvent → Diagnosable Hypotheses

This note rewrites the foundation of the OLL scalar echo approximation in a **dynamical-systems / critical-phenomena** style. It enforces a strict separation:

- ✅ **Definitions / Theorems** (standard results)
- ⚠️ **Hypotheses / Approximations** (nontrivial closures that must be validated)

The target approximation is the per-unit “echo” teaching signal:

\[
\boxed{\;\delta^e_t \;\approx\; (u_t\odot g_t)\oslash(1-\lambda\odot u_t)\;}
\]

where \(u_t=\phi'(x_t)\), \(g_t=\partial L/\partial h_t\), \(\delta_t=\partial L/\partial x_t\), and \(\lambda\in\mathbb{R}^N\) is a slowly varying per-unit loop gain.

---

## Part A — Ground Truth (✅)

### A1. ✅ RNN as a driven discrete-time dynamical system
**Definition.** For a vanilla RNN
\[
x_t=W_{hh}h_{t-1}+W_{xh}I_t+b,\qquad h_t=\phi(x_t),
\]
the state update is a map \(h_t=F(h_{t-1},I_t)\).

**Condition.** If \(\phi\in C^2\), then \(F\) is \(C^2\) and admits a Taylor expansion with quadratic remainder.

**Classic citations.** Strogatz (Nonlinear Dynamics and Chaos); Kuznetsov (Applied Bifurcation Theory); Katok & Hasselblatt (Modern Theory of Dynamical Systems).

### A2. ✅ Instantaneous Jacobian / variational dynamics
**Definition.**
\[
J_t \equiv \frac{\partial h_t}{\partial h_{t-1}}=\mathrm{diag}(u_t)\,W_{hh},\qquad u_t=\phi'(x_t).
\]
Small perturbations evolve as \(\Delta h_t=J_t\Delta h_{t-1}+\mathcal{O}(\|\Delta h_{t-1}\|^2)\).

**Condition.** Local linearization is valid for sufficiently small perturbations in a neighborhood where \(J_t\) is Lipschitz.

**Classic citations.** Same as A1 (standard variational equation results).

### A3. ✅ Edge of chaos via Lyapunov spectrum
**Definition.** Along a trajectory, define Jacobian products
\[
\Phi_{t\leftarrow t-k}=J_tJ_{t-1}\cdots J_{t-k+1}.
\]
The maximal Lyapunov exponent is (informally)
\[
\lambda_{\max}=\lim_{k\to\infty}\frac{1}{k}\log \sigma_{\max}(\Phi_{t\leftarrow t-k}).
\]
Ordered: \(\lambda_{\max}<0\), chaotic: \(\lambda_{\max}>0\), critical boundary (“edge of chaos”): \(\lambda_{\max}=0\).

**Condition.** Existence of the Lyapunov spectrum for typical trajectories follows from Oseledec’s theorem under standard integrability assumptions.

**Classic citations.** Oseledec (1968); Benettin et al. (1980) / Wolf et al. (1985) for QR estimation; Sompolinsky, Crisanti, Sommers (1988) for the order–chaos transition in random networks; Bertschinger & Natschläger (2004) for computation near the transition.

### A4. ✅ Exact BPTT adjoint recurrence
Let \(L=\sum_{t=1}^T\ell_t\). Define
\[
g_t\equiv \frac{\partial L}{\partial h_t},\qquad \delta_t\equiv \frac{\partial L}{\partial x_t},\qquad s_t\equiv u_t\odot g_t.
\]
**Theorem.** The exact recursion is
\[
\boxed{\;\delta_t = u_t\odot\big(g_t+W_{hh}^\top\delta_{t+1}\big)\;} \quad (\delta_{T+1}=0).
\]
Equivalently with \(U_t=\mathrm{diag}(u_t)\) and \(B_t=U_tW_{hh}^\top\):
\[
\delta_t=s_t+B_t\delta_{t+1}.
\]

**Classic citations.** Werbos (1990); Williams & Zipser (1989).

### A5. ✅ Neumann series (resolvent expansion)
**Theorem.** If \(\rho(A)<1\), then
\[
\boxed{\;(I-A)^{-1}=\sum_{k=0}^{\infty}A^k\;}
\]
and the series converges in any induced matrix norm.

**Classic citations.** Horn & Johnson (Matrix Analysis); Golub & Van Loan (Matrix Computations).

---

## Part B — From BPTT to “Echo” (✅→⚠️)

### B1. ✅ Green-function / path expansion of BPTT
Unrolling the exact recursion gives (for finite horizon \(T\)):
\[
\delta_t=\sum_{k=0}^{T-t}\Big(\prod_{j=0}^{k-1}B_{t+j}\Big)s_{t+k}.
\]
This is the discrete-time adjoint “Green function” representation: gradients are a **forced linear system** driven by \(\{s_{t+k}\}\).

### B2. ⚠️ H\(_\mathrm{QS}\) — Quasi-static linear response (adiabatic freezing)
**Statement.** Near the stable side of criticality, there exists a correlation time \(\tau\gg 1\) (“critical slowing down”) such that over \(k\ll\tau\),
\[
B_{t+k}\approx B_t,\qquad s_{t+k}\approx s_t
\]
on the subspace that dominates the response.

**Physical basis.**
- Critical slowing down in dynamic critical phenomena: relaxation time diverges as the leading mode approaches marginal stability.  
  (Hohenberg & Halperin 1977; Goldenfeld 1992; Kuehn 2011/2015; Scheffer et al. 2009)
- Standard adiabatic/frozen-operator approximations in multiscale dynamics.  
  (Pavliotis & Stuart 2008; Kuehn 2015)

**Consequence.** In the infinite-horizon / locally stationary approximation,
\[
\delta_t\approx \sum_{k\ge 0}B_t^k s_t=(I-B_t)^{-1}s_t,\qquad \rho(B_t)<1.
\]
This is a **time-local resolvent** (instantaneous susceptibility).

### B2′. ✅ Biophysical interpretation of \(u_t\), \(\alpha\), and “slowing down”
This is where the intermediate variables can be grounded in measurable physics/neuroscience quantities:

- \(u_{t,i}=\phi'(x_{t,i})\): **instantaneous neural gain** (slope of the static input–output curve at the operating point). In spiking/GLM language this is the local gain of the nonlinearity; in physiology it is modulated by background drive and neuromodulators (“gain modulation”).
- \(\alpha_i\) from an AR(1) fit \(h_{t+1}^{(i)}\approx \alpha_i h_t^{(i)}\): **intrinsic timescale** proxy. In discrete time, \(\alpha_i\approx e^{-\Delta t/\tau_i}\) so \(\tau_i\sim 1/(1-\alpha_i)\) as \(\alpha_i\uparrow 1\). This is the standard “critical slowing down / early warning” observable in time series.
- \(\tau_\mathrm{int}\) (integrated autocorrelation time) of a macroscopic observable (e.g. \(e_t=\|h_t\|^2/N\)) is a classic **relaxation time** proxy in statistical physics; it increases as the leading mode approaches marginality.
- The product \((\lambda_i u_{t,i})\) in the OLL echo has the meaning of a **local loop gain / local branching ratio**: it is the per-step multiplicative factor that turns a unit’s own “source” into a returned “echo” along its recurrent feedback. Criticality corresponds to loop gain \(\approx 1\) (marginality), exactly as in feedback control and in branching-process models of neural criticality.

Classic citations for slowing-down observables: Hohenberg & Halperin (1977); Goldenfeld (1992); Scheffer et al. (2009); Kuehn (2011/2015).
Classic citations for neural “branching ratio” framing: Beggs & Plenz (2003); Shew & Plenz (2013).

### B3. ✅ Near-critical resolvent implies rank-1 susceptibility (the honest simplification)
**Fact.** If \((I-B_t)^{-1}\) has a dominant singular mode,
\[
(I-B_t)^{-1}\approx \sigma_1\,u_1 v_1^\top,
\]
then for generic sources
\[
\boxed{\;\delta_t\approx \sigma_1\,u_1\,(v_1^\top s_t)\;}
\]
which is a **rank-1 response** (projection onto the slow/critical “order parameter”).

This is the standard “single slow mode dominates the susceptibility” statement in physics (single-relaxation-time / single-pole response).

**Physical basis.** Slaving/order-parameter dominance near instability.  
(Haken 1977/1983; Carr 1981; Wiggins 2003)

### B3′. ✅ What the Stage-0 probe in this repo actually checks
`Compare_RNN/analysis/oll_stage0_probe/stage0_probe.py` measures:

- \(\lambda_{\max}(g)\) (Benettin QR) and \(\rho(B_t)\) to locate the near-critical band.
- critical slowing down via \(\mathrm{corr}(e_{t+1},e_t)\) and \(\tau_\mathrm{int}\) for \(e_t=\|h_t\|^2/N\).
- rank-1 susceptibility via the SVD of \(R_t=(I-B_t)^{-1}\):  
  \(\mathrm{EVR1}=\sigma_1^2/\sum_i\sigma_i^2\). EVR1 rising toward 1 near \(\lambda_{\max}\approx 0^-\) is the operational signature of a single dominant “order parameter” mode.

### B4. ⚠️ H\(_\mathrm{Diag}\) — From rank-1 response to per-unit scalar echo
OLL’s target is not a projection but a **per-coordinate** scalarization:
\[
\delta^e_t \approx s_t\oslash(1-\lambda\odot u_t).
\]
This requires an additional structure assumption beyond rank-1 dominance.

**One explicit form.** There exists a slowly varying \(\lambda\in\mathbb{R}^N\) such that, for the *relevant source ensemble* \(\mathcal{S}\),
\[
\boxed{\;B_t s \approx \mathrm{diag}(\lambda\odot u_t)\,s\quad\text{for typical }s\sim\mathcal{S}\;}
\]
i.e. \(B_t\) acts approximately diagonally (in the coordinate basis) on the sources that learning actually generates.

**What can make this plausible (domain-dependent).**
- Strong locality / near-diagonal effective connectivity (sparse or structured \(W_{hh}\)).
- Source–mode alignment: \(s_t\) is concentrated on a coherent mode that is also approximately coordinate-separable.
- Coarse-graining / averaging arguments where off-diagonal mixing cancels.

This is the *main* nontrivial hypothesis behind the elementwise echo.

### B4′. ✅ Physics-style local alternative: “diagonal susceptibility” as the best local approximation
Even when H\(_\mathrm{Diag}\) fails as a *property of the dynamics*, there is a principled **local** approximation:

- The true susceptibility is \(R_t=(I-B_t)^{-1}\).
- A strictly local learner cannot apply \(R_t\) (it is dense/global), but it *can* approximate with the diagonal:
  \[
  \delta^{\mathrm{diagR}}_t \equiv \mathrm{diag}(R_t)\odot s_t.
  \]

This corresponds to keeping only each neuron’s **self-susceptibility** (how its own source drives its own response), discarding cross-neuron mixing. In biophysics this is the analog of measuring the neuron’s self-response to a small perturbation (optogenetic/ICMS impulse response) and using only that local gain.

Stage-0 therefore reports the diagonal dominance ratio
\[
\frac{\|\mathrm{diag}(R_t)\|_F^2}{\|R_t\|_F^2},
\]
which empirically tends to *decrease* near criticality for dense random networks, while EVR1 increases (rank-1/global mode dominance).

### B4″. ⚠️ H\(_\mathrm{Simple}\) — The susceptibility is “simple” (diagonal self-response + dominant mode)
Stage-0 shows that **dense random** networks close to criticality often become *globally* dominated (rank-1), not diagonally dominated; this clashes with strict locality.

A hypothesis that is physics-grounded *and* compatible with local plasticity is:
\[
\boxed{\;\text{H}_{\mathrm{Simple}}:\quad R_t=(I-B_t)^{-1}\ \text{has most of its energy in}\ \mathrm{diag}(R_t)\ \text{and/or a single dominant mode.}\;}
\]

Operationally we diagnose this with two measurable quantities:

- **Diagonal susceptibility fraction**
  \[
  D_t \equiv \frac{\|\mathrm{diag}(R_t)\|_F^2}{\|R_t\|_F^2}.
  \]
- **Order-parameter (rank-1) fraction**
  \[
  E_t \equiv \frac{\sigma_1(R_t)^2}{\sum_i\sigma_i(R_t)^2}.
  \]

In practice (and in Stage-1 task training), \(D_t\) may decrease while \(E_t\) increases, yet the response remains effectively “low complexity” for learning.

**Physical basis.** As one approaches instability, response becomes dominated by slow collective modes (order parameters); away from criticality, self-response dominates. This crossover is standard in linear response / susceptibility (dynamic critical phenomena + slaving).

### B5. ✅ Diagnosable error identity (operator-level, physics-style)
Let \(D_t\equiv \mathrm{diag}(\lambda\odot u_t)\) be the diagonal surrogate. Then the resolvent identity gives
\[
(I-B_t)^{-1}=(I-D_t)^{-1} + (I-B_t)^{-1}(B_t-D_t)(I-D_t)^{-1}.
\]
Applying to \(s_t\):
\[
\boxed{\;\delta_t - (I-D_t)^{-1}s_t = (I-B_t)^{-1}(B_t-D_t)\,\delta^e_t\;}
\]
with \(\delta^e_t=(I-D_t)^{-1}s_t=s_t\oslash(1-\lambda\odot u_t)\).

Therefore,
\[
\|\delta_t-\delta^e_t\|\;\le\;\|(I-B_t)^{-1}\|\;\|B_t-D_t\|\;\|\delta^e_t\|.
\]
This is “physics-style” in the sense that it isolates:
- **susceptibility** \(\|(I-B_t)^{-1}\|\) (diverges near criticality),
- **model mismatch** \(\|B_t-D_t\|\),
- **drive size** \(\|\delta^e_t\|\).

---

## Part C — Stage 0 Validation (No Tasks / No Labels / No Training)

The goal is to falsify/validate H\(_\mathrm{QS}\) and H\(_\mathrm{Diag}\), and to quantify the rank-1 susceptibility in B3.

### C1. Setup (random RNN, driven)
- Network: tanh RNN, \(N=128\sim 256\).
- Recurrent weights: \(W_{hh,ij}\sim\mathcal{N}(0,g^2/N)\), sweep \(g\in[0.5,1.5]\).
- Input drive: \(I_t\sim\mathcal{N}(0,\sigma_I^2 I)\).
- Burn-in then probe window.

### C2. Criticality diagnostics (✅)
- Finite-time \(\lambda_{\max}\) via Benettin QR on the driven trajectory.  
  (Benettin 1980; Wolf 1985; Oseledec 1968)
- Local stability proxy: \(\rho(B_t)\) at snapshots, where \(B_t=\mathrm{diag}(u_t)W_{hh}^\top\).

### C3. H\(_\mathrm{QS}\) diagnostics (⚠️)
Critical slowing down should appear as growth of correlation time of simple observables. Example:
- \(e_t=\frac{1}{N}\|h_t\|^2\): lag-1 autocorrelation \(\mathrm{corr}(e_{t+1},e_t)\) increases.
- integrated autocorrelation time \(\tau_\mathrm{int}\) increases as \(\lambda_{\max}\uparrow 0^-\).  
(Hohenberg & Halperin 1977; Goldenfeld 1992; Scheffer 2009; Kuehn 2011/2015)

### C4. Rank-1 susceptibility (B3) diagnostics (✅/⚠️)
Compute the resolvent \(R_t=(I-B_t)^{-1}\) at snapshots and its singular spectrum:
- **EVR1** \(=\sigma_1^2/\sum_i\sigma_i^2\) (fraction of response power explained by the top mode).
- If EVR1 \(\to 1\) near criticality, the susceptibility is effectively rank-1.

### C5. H\(_\mathrm{Diag}\) diagnostics for the scalar echo (⚠️)
Use synthetic sources \(s\) (no labels):
- isotropic \(s\sim\mathcal{N}(0,I)\),
- state-coupled sources \(s=u\odot h\),
- random-readout “energy” sources \(s=u\odot (W_y^\top W_y h)\).

Fit \(\lambda\) from a source ensemble by least squares:
\[
\lambda_i \approx \frac{\mathbb{E}[s_i(B_t s)_i]}{u_{t,i}\,\mathbb{E}[s_i^2]}.
\]
Then test:
- operator mismatch: \(\|B_t s - (\lambda\odot u_t)\odot s\|/\|B_t s\|\),
- resolvent error: compare \(\delta_\text{true}=R_t s\) vs \(\delta^e=s/(1-\lambda\odot u_t)\).

### C6. Implementation in this repo
Run:

```bash
python Compare_RNN/analysis/oll_stage0_probe/stage0_probe.py --operator B
```

---

## Part D — Stage 1 Validation (Task Sources; why learning can still work)

Stage-0 is a falsification test for overly-strong claims (“diagonal scalarization must hold generically near criticality”). The learning rule, however, is driven by **task sources** \(s_t=u_t\odot g_t\), which are not isotropic.

`Compare_RNN/analysis/oll_stage1_probe/stage1_probe.py` measures, after training:
- exact BPTT \(\delta_t\) (closed-form recursion),
- OLL \(\delta^e_t\),
- and susceptibility “simplicity” via \(D_t\) (diagonal fraction) and \(E_t\) (rank-1 fraction) at a representative time step.

Empirically (row-MNIST probe runs), training can reduce \(D_t\) but increase \(E_t\); the combined response remains low-complexity while \(\delta^e_t\) stays aligned (positive cosine/sign agreement) with exact \(\delta_t\).

This supports the interpretation:
- OLL does not require “pure diagonalization” of the network response;
- it requires that the **effective susceptibility seen by task sources** remains simple (self-response + few slow modes), consistent with slaving/critical slowing down and with locality constraints.

This writes `plots/oll_stage0_probe_<timestamp>/summary.json` and diagnostic curves, including:
- `lyapunov_vs_g.png`
- `energy_tau_vs_g.png`
- `rank1_evr_vs_g.png`
- scalarization error curves vs \(g\) and vs \(\lambda_{\max}\)
