# Definition of Done

Every task completed by the Ramanujan Agent must meet the criteria below before it is considered **done**. These criteria exist to prevent the most common failure modes: formulas that look correct but aren't, code that works on one example but fails at scale, and results that can't be reproduced.

---

## 1. Scientific Request

A scientific request is any task that produces a mathematical result: a new formula, an identity, a proof sketch, a convergence analysis, or an investigation of a conjecture.

### Deliverables

- [ ] **LaTeX summary document** using the template in `templates/scientific_report.tex`
  - **Bottom line first**: One-paragraph executive summary stating the main result.
  - **Formal statement**: The formula, identity, or theorem stated precisely with all notation defined.
  - **Examples**: At least 2 worked numerical examples demonstrating the result.
  - **Verification**: Numerical verification to **100+ decimal places** using `mpmath`, with the verification code included.
  - **Context**: How the result relates to known CMFs, PCFs, or prior work.
  - **Open questions**: What remains unproven or unexplored.

### Verification Criteria

- [ ] Every formula computed numerically and matched against a reference constant to 100+ digits.
- [ ] Edge cases tested: $n=0$, $n=1$, boundary values.
- [ ] If a convergence rate is claimed, it is measured empirically (digits per term at depth 100, 500, 1000).
- [ ] Cross-validated against `ramanujantools` library output where applicable.
- [ ] All code used for verification is included and runnable.

### Not Done If

- Any formula is only symbolically derived but not numerically verified.
- The LaTeX document compiles with errors.
- Examples are stated but not computed.

---

## 2. Code Development

A code development task produces new or modified Python code: a function, a module, an optimization, a C extension, a script, or a bug fix.

### Deliverables

- [ ] **Working code** that passes all tests.
- [ ] **Tests** — see [Coverage Policy](COVERAGE_POLICY.md):
  - Every new public function has at least one test.
  - Tests cover: normal operation, edge cases, and at least one known-answer verification.
  - Tests are runnable via `pytest`.
- [ ] **Guardrails** built into the code:
  - Input validation at public API boundaries (type checks, range checks, dimension checks).
  - Precision guards: `mpmath.mp.dps` set appropriately, no silent use of `float`.
  - Assertions for invariants that should never be violated (e.g., matrix determinant, recurrence order).
- [ ] **Sanity checks** that run as part of the function or test:
  - Known-answer test: compute a well-known constant and verify to expected precision.
  - Consistency check: forward and inverse operations compose to identity.
  - Scale check: if the function is meant to work at $N=10000$, test it at $N=10000$.
- [ ] **Docstring** for every new public function: one-line summary, parameters, return value, example.

### Performance Tasks (additional criteria)

- [ ] Benchmark before and after, with wall-clock times reported.
- [ ] Profile output included showing the hot path.
- [ ] Tested at realistic scale (not just toy inputs).

### Not Done If

- Any test fails.
- New public functions lack tests.
- Code uses Python `float` for mathematical computations.
- No guardrails on public API inputs.

---

## 3. Formula Discovery

A formula discovery task searches for new polynomial continued fractions, new CMF trajectories, or new representations of constants.

### Deliverables

- [ ] **The formula** stated precisely: polynomial coefficients, recurrence, or CMF trajectory.
- [ ] **Numerical verification** to 100+ digits at depth ≥ 1000.
- [ ] **Convergence rate**: digits per term, measured empirically.
- [ ] **Irrationality measure $\delta$** if applicable (via `cmf.delta()`).
- [ ] **Classification**: which CMF family does it belong to? Is it coboundary-equivalent to a known formula?
- [ ] **LaTeX summary** with the formula, verification, and classification.
- [ ] **Reproducible code** that generates the formula and verifies it.

### Not Done If

- The formula matches fewer than 100 digits.
- No convergence rate is reported.
- The formula is not classified against known CMFs.

---

## 4. Paper Writing

A paper writing task produces or edits LaTeX content for a research paper.

### Deliverables

- [ ] LaTeX that **compiles without errors** (`pdflatex` + `biber`).
- [ ] Consistent notation per the paper's notation table.
- [ ] All claims backed by either a proof, a citation, or a numerical verification.
- [ ] Cross-references use `\cref{}`.
- [ ] New references added to `references.bib` with descriptive keys.
- [ ] Figures generated from code (reproducible), not hand-drawn.

### Not Done If

- LaTeX compilation fails.
- Any claim is made without supporting evidence.
- Notation conflicts with the paper's existing conventions.

---

## 5. Bug Fix / Investigation

### Deliverables

- [ ] **Root cause identified** and explained.
- [ ] **Fix implemented** with a regression test that would have caught the bug.
- [ ] **No other tests broken** by the fix.

### Not Done If

- The root cause is not understood (a "fix" that works by coincidence is not done).
- No regression test is added.

---

## Universal Rules (Apply to All Task Types)

1. **Run the code.** Never declare done without executing the code and seeing the output.
2. **State the bottom line first.** The most important result goes in the first sentence of any deliverable.
3. **Include reproducibility information.** Python version, package versions, commands to run.
4. **Flag uncertainties.** If something is conjectured but not proven, say so explicitly.
5. **Update tests.** If your change could break existing tests, run the full suite and fix any failures.

---

## 6. Symbolic Verification (Multi-Point Walk)

When verifying symbolic expressions (CMF walk matrices, formula transcriptions):

### Deliverables

- [ ] **Multi-point substitution**: Test symbolic M_P(n) at n = 0, 1, 2, 3, 4 (minimum 5 substitution points).
- [ ] **Never accept 0-digit verification**: If a formula gives 0 correct digits, investigate K-convention and A/B swap before declaring failure.
- [ ] **K-convention check**: Test both K and 1/K. Papers may use 1/pi = K * sum (RISC) vs K/pi = sum (our code).
- [ ] **A<->B swap check**: Test all 4 combinations {(A,B,K), (B,A,K), (A,B,1/K), (B,A,1/K)} before declaring a transcription broken.
- [ ] **mpmath precision**: Never use float() on mpmath values. Keep all arithmetic in mpmath.mpf.
- [ ] **Algebraic z0 handling**: Report "n/a" for pFq walk when z0 is algebraic, but still verify via direct summation.

### Not Done If

- A symbolic expression is verified at only 1 substitution point.
- A formula is declared broken without testing all 4 A/B x K convention combinations.
- Digit counts truncate to 0 due to premature float() conversion.

---

## 7. Deliverable Sync Checklist

Every completed task must sync results to ALL destinations:

- [ ] **Paper sections** synced from paper/ to Ramanujan1914FormulasByCMF/sections/
- [ ] **Math scripts** synced to Ramanujan1914FormulasByCMF/math_scripts/
- [ ] **Overleaf repo** committed and pushed to GitHub
- [ ] **Colab notebook** updated with results, tables, and summary
- [ ] **DEFINITION_OF_DONE.md** updated with task status
- [ ] **Skills** updated if new methodology was developed

### Not Done If

- Changes exist in paper/ but are not synced to the Overleaf folder.
- Colab notebook does not reflect the latest results.
- DEFINITION_OF_DONE.md is stale.
