### Filenames and Artifacts
* `codex_pilot_spec_v3`
* `codex/` (Repository Root)
* `codex/README.md`
* `codex/data/positions/`
* `codex/data/katago_cache.jsonl`
* `codex/data/splits/train.txt`
* `codex/data/splits/test.txt`
* `codex/go_engine/board.py`
* `codex/go_engine/tools.py`
* `codex/go_engine/ladder.py`
* `codex/go_engine/territory.py`
* `codex/go_engine/life_death.py`
* `codex/boardmatcher_server/server.js`
* `codex/boardmatcher_server/package.json`
* `codex/boardmatcher_server/client.py`
* `codex/katago/subprocess_manager.py`
* `codex/katago/batch_analyzer.py`
* `codex/katago/cache.py`
* `codex/training/cold_start_sft.py`
* `codex/training/grpo_train.py`
* `codex/training/reward.py`
* `codex/training/claim_parser.py`
* `codex/training/counterfactual.py`
* `codex/training/tool_env.py`
* `codex/training/config.py`
* `codex/evaluation/eval.py`
* `codex/evaluation/concept_accuracy.py`
* `codex/evaluation/tier3_accuracy.py`
* `codex/evaluation/causal_eval.py`
* `codex/prompts/pass1_system.txt`
* `codex/prompts/pass2_system.txt`
* `codex/prompts/tool_definitions.json`
* `codex/preprocess/katago_batch.py`
* `codex/preprocess/filter_positions.py`
* `codex/preprocess/generate_sft_demos.py`
* `codex/requirements.txt`

### Code Blocks and Verbatim Snippets

**Architecture Option 1 Flow**
```text
Pass 1: board → concept schema
Pass 2: concepts → PVs / ownership / score / winrate
Pass 3: concept schema + Pass 2 predictions → natural language explanation
```

**Interleaved Prose and Claim Sequence Example**
```text
The top-right corner is the critical area this move. White's group at R15
<claim id="G1.in_atari">true</claim> is in atari with
<claim id="G1.liberties">1</claim> liberty remaining. The ladder at S15
<claim id="ladder_exists">true</claim> works for Black
<claim id="ladder_favor">black</claim>, so White cannot escape by running.
This makes the position extremely sharp
<claim id="position_sharpness">sharp</claim> — Black stands to gain roughly
6 points <claim id="best_move_score_delta">6.0</claim> and holds a
<claim id="global_black_winrate">63</claim>% win rate.
```

**Board Representation**
```text
Side: Black | Komi: 7.5 | Move: 84 | Last: W R16
 A B C D E F G H J K L M N O P Q R S T
19 . . . . . . . . . . . . . . . . . . .
18 . . . B . . . . . . . . . . . . . . .
...
```

**Inline Claim Tag Format**
```text
<claim id="FIELD_NAME">VALUE</claim>
```

**Pass 1 System Prompt**
```text
You are a Go analysis engine. Given a board position, explain your analysis in natural
language and assert structured claims using inline <claim> tags.

Your explanation should proceed as follows:
- Call tools to verify tactical and evaluative facts before asserting them.
- Write one or a few sentences explaining your reasoning about a concept.
- Immediately follow with a <claim id="field_name">value</claim> tag asserting that concept.
- Continue to the next concept.

Available tools:
- Go engine: count_liberties, is_in_atari, get_group_size, is_connected_to_edge,
 check_ladder, is_ko_active, check_capture_race.
- Pattern tools: name_move, find_pattern.
- KataGo: query_katago (max_visits 100–200).

After completing your explanation and all claim tags, output nothing further.
Pass 2 will receive only your claim values.
```

**Pass 1 Expected Generation Trace**
```text
<tool_call>{"name": "query_katago", "arguments": {"max_visits": 150,
 "include_ownership": true, "include_ownership_stdev": true}}</tool_call>
<tool_response>{"rootInfo": {"winrate": 0.63, "scoreLead": 4.5,
 "scoreStdev": 18.2, "rawStWrError": 0.07, "rawVarTimeLeft": 38.0,
 "utility": 0.74},
 "moveInfos": [
 {"move": "S15", "utility": 0.74, "scoreLead": 10.5, "order": 0,
 "pv": ["S15", "T15", "T16", "S16", "R16"], "visits": 120},
 {"move": "Q15", "utility": 0.61, "scoreLead": 6.2, "order": 1,
 "pv": ["Q15", "R14", "S14"], "visits": 20}],
 "ownership": [...], "ownershipStdev": [...]}</tool_response>

<tool_call>{"name": "is_in_atari", "arguments": {"anchor": "R15"}}</tool_call>
<tool_response>true</tool_response>

<tool_call>{"name": "count_liberties", "arguments": {"anchor": "R15"}}</tool_call>
<tool_response>1</tool_response>

<tool_call>{"name": "check_ladder", "arguments":
 {"prey_anchor": "R15", "chaser_anchor": "Q15"}}</tool_call>
<tool_response>{"ladder_works": true, "favor": "black", "escapes": false}</tool_response>

<tool_call>{"name": "name_move", "arguments": {"coord": "S15"}}</tool_call>
<tool_response>"Hane"</tool_response>

This is a middlegame position with action concentrated in the top-right corner.
<claim id="game_phase">middlegame</claim>

The KataGo evaluation shows high short-term uncertainty (rawStWrError 0.07), indicating
the position is tactically critical — a mistake here would be costly.
<claim id="position_sharpness">sharp</claim>

The overall game score is contested with a scoreStdev of 18.2, suggesting neither
side has secured the game yet.
<claim id="score_uncertainty">medium</claim>

The top-right corner is the key area, with significant ownership uncertainty —
both sides can still influence the outcome there.
<claim id="focus_region">top-right</claim>
<claim id="focus_urgency">high</claim>
<claim id="focus_ownership_balance">-3.5</claim>
<claim id="focus_ownership_uncertainty">high</claim>

White's group anchored at R15 has only one liberty and is in atari. It consists
of 4 stones and has no connection to the board edge, making it vulnerable.
<claim id="G1.color">white</claim>
<claim id="G1.anchor">R15</claim>
<claim id="G1.liberties">1</claim>
<claim id="G1.in_atari">true</claim>
<claim id="G1.size">4</claim>
<claim id="G1.connected_to_edge">false</claim>
<claim id="G1.is_key_group">true</claim>

The ownershipStdev values over this group are high (mean 0.31), confirming the
group's status is genuinely unsettled — its fate determines much of the position.
<claim id="G1.ownership_confidence">contested</claim>
<claim id="G1.status">unsettled</claim>

The ladder at S15 works for Black and White cannot escape by running out.
<claim id="ladder_exists">true</claim>
<claim id="ladder_favor">black</claim>

There is no ko fight currently active, and no capture race between major groups.
<claim id="ko_active">false</claim>
<claim id="capture_race_exists">false</claim>
<claim id="capture_race_favor">none</claim>

The primary candidate S15 is a Hane, a natural shape in this configuration.
<claim id="focus_move_pattern">Hane</claim>
<claim id="focus_shape_type">null</claim>

KataGo rates S15 at utility 0.74, significantly ahead of the second option Q15
at 0.61 — a large gap suggesting S15 is clearly best here.
<claim id="best_move_utility">0.74</claim>
<claim id="utility_gap">large</claim>
<claim id="best_move_score_delta">6.0</claim>

Black leads by approximately 4.5 points with a 63% win rate globally.
<claim id="global_black_score_lead">4.5</claim>
<claim id="global_black_winrate">63</claim>
```

**Pass 2 Input (Reconstructed JSON)**
```json
{
 "game_phase": "middlegame",
 "position_sharpness": "sharp",
 "score_uncertainty": "medium",
 "focus_region": "top-right",
 "focus_urgency": "high",
 "focus_ownership_balance": -3.5,
 "focus_ownership_uncertainty": "high",
 "G1": {
 "color": "white", "anchor": "R15", "liberties": 1,
 "in_atari": true, "size": 4, "connected_to_edge": false,
 "is_key_group": true, "ownership_confidence": "contested",
 "status": "unsettled"
 },
 "ladder_exists": true, "ladder_favor": "black",
 "ko_active": false,
 "capture_race_exists": false, "capture_race_favor": "none",
 "focus_move_pattern": "Hane", "focus_shape_type": null,
 "best_move_utility": 0.74, "utility_gap": "large",
 "best_move_score_delta": 6.0,
 "global_black_score_lead": 4.5, "global_black_winrate": 63
}
```

**Pass 2 System Prompt**
```text
Given the following Go position concept analysis, predict:
1. Top-3 principal variations (PV lines), each as a move sequence of up to 8 moves.
2. Ownership in the focus region: net balance (positive = Black) and uncertainty level.
3. Global score lead (Black positive, in points).
4. Global Black win rate (0–100%).
```

**Pass 2 Expected Output**
```json
{
 "PV1": ["B S15", "W T15", "B T16", "W S16", "B R16"],
 "PV2": ["B Q15", "W R14", "B S14", "W T15"],
 "PV3": ["B R16", "W S15", "B T15", "W T16"],
 "focus_ownership": {"balance": -3.5, "uncertainty": "high"},
 "score_lead": 4.5,
 "winrate": 63
}
```

**Tool Definitions Group A: Go rules engine**
```python
def count_liberties(anchor: str) -> int
def is_in_atari(anchor: str) -> bool
def get_group_size(anchor: str) -> int
def is_connected_to_edge(anchor: str) -> bool
def check_ladder(prey_anchor: str, chaser_anchor: str) -> dict
def is_ko_active() -> bool
def check_capture_race(group1_anchor: str, group2_anchor: str) -> dict
```

**Tool Definitions Group B: Sabaki boardmatcher**
```python
def name_move(coord: str) -> str | None
def find_pattern(coord: str) -> str | None
```

**Tool Definitions Group C: KataGo live query**
```python
def query_katago(
 max_visits: int = 200,
 include_ownership: bool = True,
 include_ownership_stdev: bool = True,
 include_policy: bool = False
) -> dict
```

**KataGo Request Configuration**
```json
{
 "maxVisits": 800,
 "includeOwnership": true,
 "includeOwnershipStdev": true,
 "includeMovesOwnership": true,
 "includeMovesOwnershipStdev": false,
 "includePolicy": true,
 "includePVVisits": true,
 "analysisPVLen": 10
}
```

**Reward Formulas**
```text
R_out = 0.30×R_pv + 0.20×R_own + 0.20×R_tier3 + 0.15×R_score + 0.15×R_wr

Δ_j = R_out(original) − R_out(counterfactual_j)
R_cf = Σ_j clip(Δ_j, 0, 0.5)

R_total = R_fmt × (0.35×R_out + 0.35×R_proc + 0.30×R_cf)
```

### Data Tables

**Self-Verification Claim Type Table**
| Claim Type | Self-Verification Sufficient? | Why |
| :--- | :--- | :--- |
| Liberty/atari count | Yes, with board simulation | Mechanically decidable, no estimation error |
| Ladder/net sequence | Yes, with step-by-step tracing | Deterministic tree search |
| Group life-and-death status | Partially — easy cases yes | Hard cases require deep search; model errors correlate with generation errors |
| Territory estimate | No | Requires understanding of influence and boundary disputes, highly model-error-prone |
| "This move is urgent" / "sente" claims | No | Requires comparative win-rate evaluation across candidate moves |

**Tier 1 — Board-engine Verifiable Fields**
| Field | Type | Description |
| :--- | :--- | :--- |
| `Gk.color` | enum: `black`, `white` | Stone color |
| `Gk.anchor` | string: coordinate | Identifier stone |
| `Gk.liberties` | int 0–8 | Exact liberty count |
| `Gk.in_atari` | bool | liberties == 1 |
| `Gk.size` | int | Number of stones in group |
| `Gk.connected_to_edge` | bool | Whether stone touches board edge |
| `Gk.is_key_group` | bool | Assessment of centrality |
| `ladder_exists` | bool | Ladder fight present |
| `ladder_favor` | enum: `black`, `white`, `none` | Who wins the ladder |
| `ko_active` | bool | Ko fight active |
| `capture_race_exists` | bool | Semeai present |
| `capture_race_favor` | enum: `black`, `white`, `even`, `none` | Who wins the capture race |

**Tier 2 — boardmatcher-grounded Fields**
| Field | Type | Description |
| :--- | :--- | :--- |
| `focus_move_pattern` | string or `null` | boardmatcher `nameMove()` |
| `focus_shape_type` | string or `null` | boardmatcher `findPatternInMove()` |

**Tier 3 — KataGo-oracle-grounded Fields**
| Field | KataGo source | Description |
| :--- | :--- | :--- |
| `game_phase` | `rawVarTimeLeft` | opening, middlegame, endgame |
| `position_sharpness` | `rawStWrError` | sharp, normal, quiet |
| `score_uncertainty` | `scoreStdev` | high, medium, low |
| `focus_region` | Model-selected | Region of relevance |
| `focus_urgency` | composite | high, medium, low |
| `focus_ownership_balance` | `ownership` sum | Net ownership, bucketed to 0.5 |
| `focus_ownership_uncertainty` | `ownershipStdev` sum | high, medium, low |
| `Gk.ownership_confidence` | `ownershipStdev` mean | settled, contested |
| `Gk.status` | ownership + stdev | alive, dead, unsettled |
| `best_move_utility` | `moveInfos[0].utility` | Bucketed to 0.01 |
| `utility_gap` | utility diff | large, medium, small |
| `best_move_score_delta` | scoreLead diff | Bucketed to 0.5 |
| `global_black_score_lead` | `rootInfo.scoreLead` | Bucketed to 0.5 |
| `global_black_winrate` | `rootInfo.winrate` | Bucketed to 1% |

**Bucketing Rules**
| Field | Rule |
| :--- | :--- |
| `position_sharpness` | rawStWrError: > 0.06 → sharp, 0.02–0.06 → normal, < 0.02 → quiet |
| `score_uncertainty` | scoreStdev: > 20 → high, 10–20 → medium, < 10 → low |
| `focus_urgency` | ownershipStdev region sum > 5 and rawStWrError > 0.03 → high; either → medium; neither → low |
| `focus_ownership_uncertainty` | ownershipStdev region sum: > 5 → high, 2–5 → medium, < 2 → low |
| `Gk.ownership_confidence` | ownershipStdev mean over group: > 0.2 → contested, ≤ 0.2 → settled |
| `Gk.status` | ownership mean > 0.7 (correct color) and stdev < 0.2 → alive; mean < −0.3 → dead; otherwise unsettled |
| `utility_gap` | diff > 0.1 → large, 0.03–0.1 → medium, < 0.03 → small |
| `game_phase` | rawVarTimeLeft: > 60 → opening, 20–60 → middlegame, < 20 → endgame |

**Process Self-Reward (R_proc) Scoring Table**
| Scenario | Score |
| :--- | :--- |
| Tool called, claim matches tool response | +1.0 × weight |
| Tool not called, claim correct per post-hoc engine check | +0.5 × weight |
| Tool not called, claim incorrect | −0.5 × weight |
| Tool called, claim contradicts tool response | −1.5 × weight |
| boardmatcher called, claim matches | +1.0 × weight |
| boardmatcher called, claim doesn't match | −0.5 × weight |
| boardmatcher not called, claim matches post-hoc | +0.3 × weight |

**Field Weights (Tier 1 + Tier 2)**
| Field | Weight |
| :--- | :--- |
| `Gk.in_atari` | 0.16 |
| `Gk.liberties` | 0.12 |
| `ladder_exists` + `ladder_favor` | 0.16 |
| `ko_active` | 0.08 |
| `Gk.status` | 0.12 |
| `capture_race_exists` + `capture_race_favor` | 0.12 |
| `Gk.size` | 0.06 |
| `Gk.connected_to_edge` | 0.06 |
| `focus_move_pattern` | 0.08 |
| `focus_shape_type` | 0.04 |

**Tier 3 Bucket Accuracy Weights**
| Field | Weight |
| :--- | :--- |
| `score_uncertainty` | 0.20 |
| `position_sharpness` | 0.20 |
| `focus_urgency` | 0.20 |
| `focus_ownership_uncertainty` | 0.20 |
| `game_phase` | 0.10 |
| `utility_gap` | 0.10 |

**Counterfactual Targets**
| Field | Perturbation |
| :--- | :--- |
| `G1.in_atari` | flip bool; adjust liberties |
| `ladder_exists` | flip bool; if false, set `ladder_favor = none` |
| `focus_ownership_uncertainty` | `high` ↔ `low`; `medium` unchanged |
| `position_sharpness` | `sharp` ↔ `quiet`; `normal` unchanged |
| `utility_gap` | `large` ↔ `small`; `medium` unchanged |

**Sequence Length Budgets**
| Pass | v0.2 budget | v0.3 budget | Reason |
| :--- | :--- | :--- | :--- |
| Pass 1 max tokens | 768 | 1024 | Prose interleaved with tags adds ~250 tokens |
| Pass 2 max tokens | 256 | 256 | Unchanged |

**Key Hyperparameters**
| Parameter | v0.3 value |
| :--- | :--- |
| Max Pass 1 tokens | 1024 |
| Max Pass 2 tokens | 256 |
| Group size G | 6 |
| Learning rate | 5e-6 |
| KL coefficient β | 0.04 |
| Reference model | Qwen3-8B-Instruct frozen |
| Training steps | 500–1000 |
| Warmup steps | 50 |
| Counterfactual fields per rollout | 5 |

**Data Pipeline Source Fractions**
| Source | Fraction |
| :--- | :--- |
| Professional game corpora | 40% |
| Synthetic critical positions (KataGo-filtered) | 40% |
| LoGos training positions with low explanation fidelity | 20% |

**Estimated Training Timeline (10k positions, 1 epoch)**
| Phase | Estimated time |
| :--- | :--- |
| KataGo pre-analysis (800 visits) | ~2 hours |
| boardmatcher pre-computation | ~30 minutes |
| SFT cold-start (1000 demos, 2 epochs) | ~2–3 hours |
| RL training (500 steps, G=6, 5 cf, 1024 tokens) | ~20–28 hours |
| Evaluation + human review sample | ~3 hours |
| **Total** | **~28–36 hours** |

### Lists and Enumerations

**Base Model Selection**
*   Recommended: Qwen3-8B (non-thinking mode)
*   Fallback: Qwen2.5-7B-Instruct

**Training Filtering Criteria**
*   Top-3 KataGo moves differ by ≥ 5% utility
*   `scoreStdev` > 15
*   At least one of: atari present, ladder present, capture race, `ownershipStdev` sum over any quadrant > 3

**Pilot Success Criteria**
1.  `in_atari` accuracy > 95%, self-contradiction rate < 5%
2.  `ladder_exists` accuracy > 85%
3.  `focus_move_pattern` match rate > 70% on named positions
4.  Top-1 PV first-move match > 55%
5.  Mean causal delta for `in_atari` > 0.05
6.  Mean causal delta for `position_sharpness` > 0.03
7.  Tier 1 concept accuracy ≥ 5 percentage points above the two-pass no-tool baseline
8.  Human evaluation: ≥ 60% of prose-claim pairs rated "supports the claim"

**Known Risks**
*   Vacuous prose generation
*   Claim tag appearing before reasoning
*   Reward hacking through explanation style
*   Faithfulness plateau
*   Domain knowledge bottleneck
*   Unfaithful verbose CoT
*   Reward hacking at the PV boundary
*   PV divergence at high move count

**Deferred Phase 2 Decisions**
*   Prose-claim consistency reward
*   Prose quality optimization (RLHF/preference learning)
*   `joseki_deviation` claim field
*   Human SL model integration via KataGo `humanPolicy`
*   Open-ended discovered concept slots
*   Full-strength life-and-death solver
*   veRL migration for multi-node scaling