# Condition B: Sequential PDF-to-Chat Baseline

You are running a timed baseline trace for a paper evaluation.

## Task

Starting from the topic `automated theorem proving proof assistants`, produce the same output as the Lacuna condition:

1. one scoped research question;
2. three supporting source-linked observations;
3. one limitation or failure mode;
4. a concise route summary.

## Baseline Constraint

Do not use Lacuna pages, Lacuna `/md` routes, or Lacuna-generated summaries.

This baseline approximates a PDF-to-chat workflow where the agent must summarize each academic paper in turn before deciding whether it helps the route. To make the run reproducible, use the following fixed source-paper order. For each item:

1. fetch the public source paper or source article;
2. for arXiv PDFs, extract and read the first 10 pages with `pdftotext`;
3. for the StepFun source article, extract as much main article text as is available from the public page;
4. write a Lacuna-style markdown note before moving to the next source;
5. decide whether it supports the emerging research question;
6. only then move to the next item.

Use Bash with `curl -L -sS`, `pdftotext`, and ordinary shell tools. You may write temporary files in the current working directory. Keep extracted text bounded with `head -c 50000`.

## Source-Paper Order

- ProofNet PDF: `https://arxiv.org/pdf/2302.12433`
- FIMO PDF: `https://arxiv.org/pdf/2309.04295`
- StepFun-Prover source article: `https://stepfun.ai/research/en/stepfun-prover-preview`
- VERINA PDF: `https://arxiv.org/pdf/2505.23135`

If a source page or PDF is unavailable, record the failure and continue to the next source page. Do not replace it with a Lacuna page.

For each arXiv PDF, use a command of this form before summarizing it:

```bash
curl -L -sS -o proofnet.pdf https://arxiv.org/pdf/2302.12433
pdftotext -f 1 -l 10 -layout proofnet.pdf - | head -c 50000
```

After reading each source, immediately create a markdown note with a Bash here-doc before fetching the next source. Use these filenames:

- `proofnet_note.md`
- `fimo_note.md`
- `stepfun_note.md`
- `verina_note.md`

Each note should be 650-900 words and structured like a compact Lacuna paper page:

```markdown
# Paper Title

## Summary
...

## Method / System
...

## Key Results
...

## Limitations And Open Questions
...
```

Do not include the full note text in the final JSON; the trace will capture that you generated it.

## Output Format

Return only valid JSON with these fields:

```json
{
  "condition": "pdf_to_chat_baseline",
  "route": [{"url": "...", "reason": "..."}],
  "pages_read_count": 4,
  "notes_written": [{"paper": "...", "file": "...", "approx_words": 750, "decision": "use|skip|unclear"}],
  "source_checks": [{"claim": "...", "checked_against": "...", "status": "supported|proposal_level|limitation|unclear"}],
  "final_question": "...",
  "observations": ["...", "...", "..."],
  "limitation": "...",
  "blocking_model_calls_needed": 6,
  "notes": "..."
}
```

`blocking_model_calls_needed` should be a conservative estimate for an ordinary sequential PDF-to-chat workflow: one call to plan plus one call for each paper summary before final synthesis.
