# PEPFlow — Claude Code Collaboration Guide

PEPFlow is a library for analyzing convergence of optimization algorithms via the
Performance Estimation Problem (PEP) framework. This file documents conventions and
patterns for AI-assisted analysis.

## Project Layout

```
pepflow/               Core library (functions, operators, PEP builder, solver, utils)
examples/              One subdirectory per algorithm, each containing:
    {algo}_setup.py    Reusable setup module (interface for pep_runner.py)
    {algo}_example.ipynb  Full analysis notebook
docs/source/           Sphinx documentation source
scripts/               Utility shell scripts (check.sh for lint/test)
```

## Running PEPFlow

```bash
# Activate the virtual environment (required before running any Python):
cd PEPFlow && source .venv/bin/activate

# Or prefix with the venv Python directly (no activation needed):
.venv/bin/python3
```

## Convergence Analysis Workflow

Each analysis notebook follows these phases in order:

| Phase | Description | Automatable? |
|---|---|---|
| 1 | Define function/operator | Partial (template) |
| 2 | Write `make_ctx_{algo}` encoding the update rule | Partial |
| 3 | Numerical convergence sweep (solve PEP for N=1..8, plot) | Yes |
| 4 | Verification solve at fixed N: extract λ, τ, S | Yes |
| 5 | Propose closed-form λ (dual interpolation variables) | Claude assists |
| 6 | Propose closed-form S (Gram dual matrix) | Claude assists |
| 7 | Assemble symbolic proof: verify LHS − RHS = 0 | Yes (once forms known) |

## Key PEPFlow API Patterns

### Setup (smooth convex function)
```python
L = pf.Parameter("L")
f = pf.SmoothConvexFunction(is_basis=True, tags=["f"], L=L)
ctx = pf.PEPContext("ctx_name").set_as_current()
x = pf.Vector(is_basis=True, tags=["x_0"])
f.set_stationary_point("x_star")

for i in range(N):
    x = x - 1/L * f.grad(x)   # algorithm update rule
    x.add_tag(f"x_{i+1}")
```

### Setup (monotone operator)
```python
A = pf.MonotoneOperator(is_basis=True, tags=["A"])
ctx = pf.PEPContext("ctx_name").set_as_current()
x = pf.Vector(is_basis=True, tags=["x_0"])
y = x.add_tag("y_0")
A.set_zero_point("x_star")

alpha = pf.Parameter("alpha")
for i in range(N):
    x = A.resolvent(y, alpha, tag=f"x_{i+1}")
    y = (...).add_tag(f"y_{i+1}")   # algorithm-specific update
```

### Build and solve PEP
```python
pb = pf.PEPBuilder(ctx)
pb.add_initial_constraint(
    ((ctx["x_0"] - ctx["x_star"])**2).le(R, name="initial_condition")
)
pb.set_performance_metric(f(ctx[f"x_{N}"]) - f(ctx["x_star"]))
result = pb.solve(resolve_parameters={"L": sp.S(1), "R": sp.S(1)})
print(result.opt_value)
```

### Extract dual variables
```python
lamb_sol = result.get_scalar_constraint_dual_value_in_numpy(f)  # or A
S_sol    = result.get_gram_dual_matrix()
tau_sol  = result.dual_var_manager.dual_value("initial_condition")

lamb_sol.pprint()   # display as LaTeX matrix
S_sol.pprint()
```

### Evaluate closed-form candidates
```python
pm = pf.ExpressionManager(ctx, resolve_parameters={"L": sp.S(1)})

# Evaluate a pf.Scalar expression to a numpy matrix in the inner-product basis:
S_guess_eval = pm.eval_scalar(S_guess_expr).inner_prod_coords
print(np.allclose(S_guess_eval, S_sol.matrix, atol=1e-4))

# Evaluate a pf.Vector to coordinate vector:
v_coords = pm.eval_vector(ctx["x_1"]).coords
```

### Symbolic proof assembly
```python
# Assemble weighted sum of interpolation inequalities:
interp_sum = pf.Scalar.zero()
for tag_i, tag_j in itertools.product(row_names, col_names):
    if lamb(tag_i, tag_j) != 0:
        interp_sum += lamb(tag_i, tag_j) * f.interp_ineq(tag_i, tag_j)

# Display symbolically (human-readable, NOT sympy-parseable — see warning below):
diff = LHS - interp_sum + S_guess
pf.pprint_str(diff.repr_by_basis(ctx, sympy_mode=True, resolve_parameters={"L": sp.S("L")}))

# Verify numerically (the authoritative check):
pm = pf.ExpressionManager(ctx, resolve_parameters={"L": sp.S(1)})
diff_matrix = pm.eval_scalar(diff).inner_prod_coords
print("Proof valid:", np.allclose(diff_matrix, 0, atol=1e-6))
```

### repr_by_basis output is NOT sympy-parseable

`diff.repr_by_basis(..., sympy_mode=True)` returns a human-readable Unicode string using
⟨·,·⟩ for inner products and |·|² for norms. Passing it to `sp.sympify()` always raises
`SyntaxError`. The correct pattern is:

```python
# Display only — not parseable by sympy:
pf.pprint_str(diff.repr_by_basis(ctx, sympy_mode=True, resolve_parameters={...}))

# Numerical zero-check via ExpressionManager (authoritative):
pm = pf.ExpressionManager(ctx, resolve_parameters=params_sp)
diff_matrix = pm.eval_scalar(diff).inner_prod_coords   # → numpy matrix
print("Proof valid:", np.allclose(diff_matrix, 0, atol=1e-6))
```

### Parameter precision

`pf.Parameter` values passed as Python floats (e.g., `beta=0.3333`) become sympy `Float`
objects, which accumulate floating-point error in exact-arithmetic proofs. For clean
closed-form work, pass sympy Rationals directly or use string params with `pep_runner.py`:

```python
# In notebooks — pass sympy Rationals:
params_sp = {"L": sp.S(1), "beta": sp.Rational(1, 3)}

# In pep_runner.py CLI — use string values for exact fractions:
# --params '{"L": 1, "R": 1, "beta": "1/3"}'
# The runner converts strings via sp.Rational(), floats via sp.nsimplify(..., rational=True)
```

## pep_runner.py — Command-Line Interface

`pepflow/pep_runner.py` solves a PEP from the terminal and returns JSON output.
Used by the `/pep-analyze` slash command to automate the numerical phases.

```bash
# Solve for fixed N, get full dual variable data:
.venv/bin/python3 -m pepflow.pep_runner \
    --module examples/{ALGO_NAME}/{ALGO_NAME}_setup.py \
    --N 4 \
    --params '{"L": 1, "R": 1}' \
    --output /tmp/results.json

# With relaxed constraints (constraints to exclude from the dual):
.venv/bin/python3 -m pepflow.pep_runner \
    --module examples/{ALGO_NAME}/{ALGO_NAME}_setup.py --N 4 \
    --params '{"L": 1, "R": 1}' \
    --relaxed '["f:x_0,x_2", "f:x_1,x_3"]'
```

Output JSON fields:
- `N`, `opt_value`, `tau_sol`
- `lambda_matrix`, `lambda_row_names`, `lambda_col_names`
- `S_matrix`, `S_row_names`, `S_col_names`
- `basis_vectors`

The setup module must define `get_pep_setup(N, params) -> (ctx, pb, obj)` with the
initial condition and performance metric already set on `pb`. Use `ctx_name=f"ctx_{N}"`
to allow repeated calls (e.g., in a numerical sweep loop) without registry conflicts.

## Reference Examples

| Algorithm | Problem type | Reference notebook |
|---|---|---|
| Gradient Descent | smooth convex | `examples/_references/gd/gd_example.ipynb` |
| Optimized Gradient Method | smooth convex | `examples/ogm/ogm_example.ipynb` |
| Accelerated Gradient Method | smooth convex | `examples/agm/agm_example.ipynb` |
| Proximal Gradient Method | composite | `examples/pgm/pgm_example.ipynb` |
| APPM / Optimized Halpern | monotone operator | `examples/_references/appm/appm_example.ipynb` |
| Douglas-Rachford Splitting | monotone operator | `examples/drs/drs_example.ipynb` |

## Automated Analysis: /pep-analyze

The `/pep-analyze` slash command (in `PEPFlow-codework/.claude/commands/`) runs the
full 8-phase workflow automatically. Invoke it from Claude Code:

```
/pep-analyze "Heavy Ball method: x_{k+1} = x_k - (1/L)*grad_f(x_k) + beta*(x_k - x_{k-1}) on L-smooth convex functions"
```

Phases: parse → setup module → numerical sweep → verification solve →
closed-form λ → closed-form S → symbolic proof → notebook generation.
