---
name: research-plan
description: "[Read when prompt contains /research-plan]"
metadata:
  {
    "agent-runtime":
      {
        "emoji": "📋",
      },
  }
---

# Research Plan

**Don't ask permission. Just do it.**

**Workspace:** `$W` = working directory provided in the task parameter.

## Prerequisites

| File | Source |
|------|--------|
| `$W/task.json` | /research-pipeline or user |
| `$W/survey_res.md` | /research-survey |
| `$W/notes/paper_*.md` | /research-survey |
| `$W/repos/` | /research-collect Phase 3 |
| `$W/prepare_res.md` | /research-collect Phase 3 |

**If `survey_res.md` is missing, STOP:** "Run /research-survey first to complete the deep analysis."

## Output

| File | Content |
|------|---------|
| `$W/plan_res.md` | Four-part implementation plan. |

---

## Workflow

### Step 1: Read context

Read the following files to understand the research goal and the technical approach:

- `$W/task.json` — research goal.
- `$W/survey_res.md` — recommended technical route, core formulas, **formula → code mapping table**, summary of the reference-code architecture.
- `$W/prepare_res.md` — list of reference repos and their key files.

### Step 2: Deep analysis of the reference code

**⚠️ Mandatory step (Novix Plan Agent mechanism)** — read the implementation details of the reference repos to make sure the plan rests on something concrete and feasible.

For the key repos in `prepare_res.md`:

1. Read the directory structure and the README.
2. Read the core model code to understand how the architecture is implemented.
3. Read the training script to understand hyperparameter choices and training tricks.
4. Read the data-loading code to understand the preprocessing pipeline.

For each component, record **reference file path + key implementation details**. This information goes directly into the "reference code" column of the plan.

### Step 3: Write the four-part plan

Write `$W/plan_res.md`:

```markdown
# Implementation Plan

## 1. Dataset Plan

- **Dataset name:** {name}
- **Source:** {URL or description}
- **Size:** {samples / size}
- **Preprocessing steps:**
  1. {step}
  2. {step}
- **DataLoader design:**
  - batch_size: {value}
  - Input format: {shape}
  - Output format: {shape}

## 2. Model Plan

- **Architecture overview:** {1-2 sentences}
- **Component list:**

| Component | Formula | Reference code | Input → Output |
|-----------|---------|----------------|----------------|
| {component} | $formula$ | `repos/xxx/file.py` | {shape} → {shape} |

- **Parameter count estimate:** {approximate}

## 3. Training Plan

- **Loss:** {formula + description}
- **Optimizer:** {Adam/SGD/...}, lr={value}
- **Scheduler:** {if any}
- **Training parameters:**
  - epochs (validation): 2
  - epochs (full): {value}
  - batch_size: {value}
- **Metrics to monitor:** {loss, metrics to log}

## 4. Testing Plan

- **Evaluation metrics:**

| Metric | Formula / description | Expected range |
|--------|------------------------|----------------|
| {metric} | {description} | {range} |

- **Baselines:** {what to compare against}
- **Ablation studies (initial plan):**
  1. {ablation 1}
  2. {ablation 2}
```

### Step 4: Self-check

Verify the plan is complete:

- [ ] Every model component has a corresponding formula.
- [ ] **Every component has its "reference code" column filled in** (when `repos/` exists).
- [ ] The dataset has a concrete acquisition path (URL or download command).
- [ ] The loss has a mathematical definition.
- [ ] Evaluation metrics are clearly defined.
- [ ] Training parameters are reasonable (no `lr=0.1` for Adam).

For uncertain items, mark them as `⚠️ TODO: {reason}` in the plan.

---

## Rules

1. Every component in the plan must trace back to a formula or method in `survey_res.md`.
2. Do not write a "generic" plan — every parameter needs a concrete value or a reasonable estimate.
3. If reference repos exist, the component table must include reference-code paths.
4. `plan_res.md` is considered done when all four sections exist and are non-empty.
