# Autoformalization Route Audit

Last checked: 2026-05-08.

Reviewer MCP artifact: <https://anonymous.4open.science/r/research-map-mcp-artifact-20260513-600C/README.md>

This supplement records a small manual audit for the fixed Lacuna reviewer route. The audit checks whether generated direction and proposal pages preserve enough source evidence for a reader or agent to verify the path from broad topic to candidate research question.

## Route

1. Start with RRF search: <http://34.8.208.118/md/render/search?q=automated%20theorem%20proving%20proof%20assistants>
2. Open direction: [The Autoformalization Gap in Theorem Proving](http://34.8.208.118/md/render/direction/the-autoformalization-gap-in-theorem-proving-23293)
   - Served count: 369 papers, 762 concepts.
   - Reviewer route: <http://34.8.208.118/md/render/direction/the-autoformalization-gap-in-theorem-proving-23293>
3. Select related direction: [Iterative Compiler Feedback for Formal Theorem Proving](http://34.8.208.118/md/render/direction/iterative-compiler-feedback-for-formal-theorem-proving-12169)
   - Served count: 119 papers, 310 concepts.
   - Reviewer route: <http://34.8.208.118/md/render/direction/iterative-compiler-feedback-for-formal-theorem-proving-12169>
4. Inspect paper pages for source evidence:
   - [ProofNet](http://34.8.208.118/md/render/paper/proofnet-autoformalizing-and-formally-proving-undergraduate-level-mathematics/art_f5a5f3551f6641598e578328d5771b3b)
   - [FIMO](http://34.8.208.118/md/render/paper/fimo-a-challenge-formal-dataset-for-automated-theorem-proving/art_d65417c966cc4525ab8bd63248394361)
   - [StepFun-Prover](http://34.8.208.118/md/render/paper/stepfun-prover-preview-let-s-think-and-verify-step-by-step/art_804f191b837940a1a2b73568d30d29b0)
   - [VERINA](http://34.8.208.118/md/render/paper/verina-benchmarking-verifiable-code-generation/art_ae1f0fb500a44279ba999f4c6df4ed6e)
   - [No LLM Solved Yu Tsumura's 554th Problem](http://34.8.208.118/md/render/paper/no-llm-solved-yu-tsumura-s-554th-problem/art_4376e8c651d648baa4a26450cc60b311)
5. End at proposal page: [Token-Level Alignment of Informal Mathematics to Formal Compiler States](http://34.8.208.118/md/render/hypothesis/token-level-alignment-of-informal-mathematics-to-formal-compiler-states-5f6a6e04c0bdbac4)
   - Reviewer route: <http://34.8.208.118/md/render/hypothesis/token-level-alignment-of-informal-mathematics-to-formal-compiler-states-5f6a6e04c0bdbac4>

## Claim Checks

| Generated claim or route claim | Evidence inspected | Audit status | Notes |
|---|---|---|---|
| Autoformalization is hard because informal mathematics leaves assumptions, types, and proof steps implicit, while proof assistants require explicit formal structure. | Autoformalization direction; ProofNet page. | Supported. | The direction frames this as a mismatch between compressed human mathematical writing and strict formal languages. The ProofNet page gives concrete evidence through Lean typechecking, implicit hypotheses, and retrieval-based formalization. |
| Compiler feedback can turn formal theorem proving from one-shot generation into iterative correction. | Iterative compiler-feedback direction; FIMO; StepFun-Prover; VERINA. | Supported. | The direction and paper pages describe loops where a model proposes formal steps, a compiler or verifier returns errors, and the model revises. FIMO reports improved formalization from compiler feedback; VERINA shows feedback helps but remains costly and incomplete. |
| Iterative feedback alone does not solve deeper logical or proof-generation gaps. | VERINA; No LLM Solved Yu Tsumura's 554th Problem. | Supported limitation. | The pages report that repeated feedback improves some outputs but still fails on harder proof obligations, especially where models lack the needed theorem-proving strategy. |
| Token-level alignment of informal mathematics to formal compiler states is a validated result. | Token-level alignment proposal; FIMO; compiler-feedback direction. | Not supported as a result. | The page is a generated research proposal. It should be presented as a candidate problem formulation surfaced by navigation, not as an established finding. |
| The route is perfectly clean and contains only theorem-proving papers. | Direction context and related papers from the live API. | False; useful failure case. | The route includes some adjacent or noisy papers from broader verifiable-reasoning and tool-feedback neighborhoods. Lacuna exposes this through paper-level pages and relation tabs, so the user must select and audit the relevant branch. |

## Outcome

The audit supports the paper's bounded claim: Lacuna makes a generated research route inspectable, source-linked, and agent-readable. It does not certify that every generated synthesis page is correct. In this route, the final proposal is useful precisely because the source-linked navigation makes clear which claims are supported by prior work and which remain open hypotheses.
