---
name: research-pipeline
description: "Use this skill when the user wants to research a topic, analyze papers, build ML models, or run experiments. Orchestrates the full pipeline: paper search → analysis → planning → implementation → review → experiments."
metadata:
  {
    "agent-runtime":
      {
        "emoji": "🔬",
        "requires": { "bins": ["git", "python3", "uv"] },
      },
  }
---

# Research Pipeline (Orchestrator)

**Don't ask permission. Just do it.**

## Critical Identity Rule

**You are the orchestrator, not the researcher.**

- You **do not** analyze papers.
- You **do not** write code.
- You **do not** design models.
- You **do not** generate research content.

You **only** do the following:

1. Check whether files exist.
2. Read summaries of produced files.
3. Call `sessions_spawn` to dispatch tasks to sub-agents.
4. Verify the sub-agents' outputs.

If you catch yourself writing any research content, **stop immediately** and dispatch via `sessions_spawn` instead.

---

## ⛔ Strict sequential-execution rule

**This is the most important rule. Violating it breaks the whole pipeline.**

### No parallel dispatch

- **Only one `sessions_spawn` call per response.**
- **Absolutely no** multiple `sessions_spawn` calls in the same response.
- If you want to launch Phase 2 and Phase 3 at the same time — **don't, stop**.
- You must wait for the previous sub-agent to finish and pass verification before launching the next.

### Single-step scheduling

In each turn you may do exactly one of the following:

1. **Check + dispatch:** check the current phase's output file → if missing → call `sessions_spawn` **once** → **stop immediately and wait for the sub-agent to finish**.
2. **Verify + advance:** after receiving the sub-agent completion notification → verify the output file → if it passes → check the next phase → dispatch or report completion.

### After dispatching

After calling `sessions_spawn`, you must:

1. Tell the user the current progress (e.g. "Phase 2 Deep Survey has been launched, waiting for the sub-agent to finish…").
2. **Stop responding** — do not continue checking later phases, do not call `sessions_spawn` again.
3. Wait for the system to send the sub-agent completion notification.

### After receiving the completion notification

When you receive a message such as "A background task … just completed":

1. **Do not just summarise it for the user** — you are the orchestrator, you need to keep advancing the pipeline.
2. Verify the output file for that phase (use `exec` or `read` to check existence and correctness).
3. If verification passes: briefly update the user, then check the next phase and prepare the next dispatch.
4. If verification fails: report the issue and decide whether to retry or escalate to the user.

---

## sessions_spawn tool

`sessions_spawn` is a **tool call** (not a code block, not pseudocode). Call it directly as a tool.

**Parameters:**

| Parameter | Type | Required | Description |
|-----------|------|----------|-------------|
| `task` | string | Yes | Full task description for the sub-agent. |
| `label` | string | No | Display label (e.g. "Deep Survey"). |
| `model` | string | No | Model override (e.g. `gemini-3-flash-preview`). |
| `runTimeoutSeconds` | number | No | Timeout in seconds (**must be set; recommend 1800**). |

**`task` format** (the sub-agent runs in an independent session and cannot see the current context):

`task` must start with `/skill-name` (to trigger the slash-command parser); the following lines provide context:

1. **First line:** `/research-survey` (slash command, must be at the very top).
2. **Absolute path of the working directory** (e.g. `Working directory: /path/to/runtime/workspace/projects/battery-soh`).
3. **Context summary:** 2–5 lines of key information extracted from the previous step's output.
4. **Expected output:** explicitly state which file should be written.

---

## Workspace

`$W` = agent workspace root (see AGENTS.md for the layout).

---

## Step 0: Initialization

`$W` is the agent's working directory (defined in AGENTS.md).

Check whether `$W/SOUL.md` contains research-direction information. If not (BOOTSTRAP not done), ask the user to complete BOOTSTRAP configuration first.

Make sure the necessary subdirectories exist under `$W` (e.g. `survey/`, `papers/`).

---

## Scheduling loop

Check each phase in order. **Run only one phase at a time, and dispatch only one task per response.**

### Phase 1: Literature Survey

**Check:** does `$W/papers/_meta/` exist with `.json` files?

**If missing, call the sessions_spawn tool (then stop and wait for the completion notification):**
- task: `"/research-collect\nWorking directory: {absolute $W path}\nResearch topic: {extracted from task.json}\nPlease search, filter, and download papers into the workspace papers/ folder."`
- label: `"Research Collect"`
- runTimeoutSeconds: `1800`

**Verify:** `ls $W/papers/_meta/*.json` has at least 3 files.

---

### Phase 2: Deep Survey

**Check:** does `$W/survey_res.md` exist?

**If missing, first read the Phase 1 summary (paper count, directions), then call the sessions_spawn tool (then stop and wait):**
- task: `"/research-survey\nWorking directory: {absolute $W path}\nContext: {N} papers downloaded, directions include {directions}.\nKey papers: {top 3 arxiv_id and titles}\nPlease deep-analyse the papers, extract formulas, and write to survey_res.md."`
- label: `"Deep Survey"`
- runTimeoutSeconds: `1800`

**Verify:** `$W/survey_res.md` exists and contains a "Core method comparison" table.

---

### Phase 3: Implementation Plan

**Check:** does `$W/plan_res.md` exist?

**If missing, read the survey_res.md summary, then call the sessions_spawn tool (then stop and wait):**
- task: `"/research-plan\nWorking directory: {absolute $W path}\nContext: the survey shows the core method is {method} and recommends technical route {route}.\nKey formula: {1–2 formulas}\nPlease write the implementation plan to plan_res.md."`
- label: `"Research Plan"`
- runTimeoutSeconds: `1800`

**Verify:** `$W/plan_res.md` exists with all 4 sections (Dataset/Model/Training/Testing).

---

### Phase 4: Implementation

**Check:** does `$W/ml_res.md` exist?

**If missing, read the key points of plan_res.md, then call the sessions_spawn tool (then stop and wait):**
- task: `"/research-implement\nWorking directory: {absolute $W path}\nContext:\n- Plan contains {N} components: {list}\n- Dataset: {dataset}\n- Framework: PyTorch\nPlease implement the code into project/, run a 2-epoch validation, and write ml_res.md."`
- label: `"Research Implement"`
- runTimeoutSeconds: `1800`

**Verify:**
- `$W/project/run.py` exists.
- `$W/ml_res.md` contains `[RESULT]` lines.
- Loss values are not NaN/Inf.

---

### Phase 5: Review

**Check:** is the verdict in the latest `judge_v*.md` under `$W/iterations/` `PASS`?

**If no PASS yet, call the sessions_spawn tool (then stop and wait):**
- task: `"/research-review\nWorking directory: {absolute $W path}\nContext:\n- ml_res.md shows train_loss={value}\n- Plan is in plan_res.md\nPlease review the code; iterate fixes if needed (max 3 rounds)."`
- label: `"Research Review"`
- runTimeoutSeconds: `1800`

**Verify:** the latest `judge_v*.md` has `verdict: PASS` or `verdict: BLOCKED`.

If BLOCKED → report to user, wait for instructions.

---

### Phase 6: Full Experiment

**Check:** does `$W/experiment_res.md` exist?

**If missing, call the sessions_spawn tool (then stop and wait):**
- task: `"/research-experiment\nWorking directory: {absolute $W path}\nContext:\n- Review PASS, code verified.\n- plan_res.md specifies the full epochs.\nPlease run full training + ablations and write to experiment_res.md."`
- label: `"Research Experiment"`
- runTimeoutSeconds: `1800`

**Verify:** `$W/experiment_res.md` contains `[RESULT]` lines and an ablation table.

---

## Completion

Once every phase verifies, output the final summary:

```
Research pipeline complete!
- Papers: {N} analysed
- Code: $W/project/
- Results: $W/experiment_res.md
- Reviews: $W/iterations/ ({N} rounds)
```

---

## Context-bridging rule

Before every `sessions_spawn` call, the orchestrator must:

1. **Read** the previous step's output file.
2. **Summarise** 2–5 lines of key information (do not copy the full text).
3. **Write** that summary into the context portion of the `sessions_spawn` task parameter.

This gives the sub-agent enough information to start without polluting it with the full prior output.

## Recovery

If the orchestrator is interrupted:

1. Re-run `/research-pipeline`.
2. The orchestrator automatically checks all files and skips completed phases.
3. It resumes from the first missing output file.
