# Static Source-Structure and Mathlib-Leverage Proxy

This note summarizes a static source proxy for the submitted FormalSLT snapshot. It is meant to help reviewers calibrate whether the artifact is mostly isolated files, mostly direct Mathlib calls, or a library with internal reuse. It is not a proof of mathematical novelty and should not be read as a causal measurement of agent productivity.

Command:

```bash
python3 scripts/analyze_source_structure.py \
  --tarball formalslt_anonymized.tar.gz \
  --json-out SOURCE_STRUCTURE_ANALYSIS.json
```

Headline counts:

| Quantity | Value |
|---|---:|
| Lean source files including `FormalSLT.lean` | 46 |
| Source lines | 20,080 |
| Comment-skipped theorem/lemma declarations | 412 |
| Mathlib import lines | 174 |
| Internal `FormalSLT` import lines | 112 |
| Declarations with at least one local theorem/lemma reference | 257 / 412 = 62.4% |

By chain:

| Chain | Files | Source lines | Theorems | Lemmas | Mathlib imports | Internal imports | Declarations with local refs |
|---|---:|---:|---:|---:|---:|---:|---:|
| Azuma | 9 | 2,392 | 13 | 45 | 16 | 14 | 26 |
| Covering | 4 | 6,460 | 101 | 33 | 13 | 6 | 98 |
| PAC-Bayes | 4 | 2,022 | 33 | 1 | 14 | 5 | 27 |
| PAC-VC | 6 | 1,107 | 12 | 11 | 13 | 10 | 13 |
| Probability | 3 | 527 | 16 | 0 | 18 | 0 | 0 |
| Rademacher | 13 | 4,050 | 37 | 35 | 78 | 23 | 40 |
| Risk/ERM spine | 5 | 1,193 | 17 | 4 | 15 | 50 | 14 |
| Stability | 2 | 2,329 | 38 | 16 | 7 | 4 | 39 |

Method note:

- Declaration spans run from each theorem/lemma line to the next top-level declaration.
- Internal-reference counts match local theorem/lemma names textually inside those spans, excluding short or common names.
- This is a static proxy. It does not distinguish every Mathlib lemma call inside tactic terms, and it does not prove that a declaration is mathematically novel.
- The useful signal is compositional structure: 62.4% of theorem/lemma declarations mention at least one theorem or lemma defined in the submitted source snapshot, so the artifact is not just a flat set of isolated wrappers around imports.

## Local-Reference Graph Companion

For a stricter source-level reuse check, run:

```bash
python3 scripts/analyze_local_reference_graph.py \
  --tarball formalslt_anonymized.tar.gz \
  --json-out LOCAL_REFERENCE_GRAPH_ANALYSIS.json \
  --markdown-out LOCAL_REFERENCE_GRAPH_ANALYSIS.md \
  --matrix-csv LOCAL_REFERENCE_CHAIN_MATRIX.csv \
  --fig-dir figures
```

This builds a conservative theorem/lemma graph where an edge `A -> B` is added
only when declaration `A` textually references the unique local declaration name
for `B`. The submitted snapshot gives 463 static local-reference edges, 257
declarations with textual local-reference edges, 295 declarations textually
referenced by at least one other declaration, a largest weak component of 177
declarations, and maximum static local-reference depth 18.

This remains a source proxy. It should not be described as Lean kernel
dependency extraction, theorem-difficulty measurement, novelty evidence, or
agent-productivity measurement.
