# Building a KataGo-Grounded Transformer for Natural Language Go Analysis

## Executive Summary

Building a model that generates natural language explanations of Go positions — backed by genuine positional understanding rather than LLM hallucination — requires a multi-stage system. The core design challenge is that KataGo provides numerical ground truth (win rates, ownership maps, score leads, principal variations) with no accompanying natural language rationale. A transformer must bridge this gap by learning to translate KataGo's structured outputs into accurate, strategy-grounded prose. The most complete prior art for this is the **Mastermind-Go** pipeline (Shanghai AI Lab / CUHK, 2025), which fine-tuned LLaMA-2-7B using a four-stage curriculum drawing from KataGo self-play, KataGo analysis JSON, and Go book commentary. This guide details how to build and improve on that approach.[^1][^2][^3]

***

## 1. Architecture Overview

The system has three main layers:

1. **Board Encoder** — Converts the 19×19 Go position into a structured representation the transformer can process spatially and sequentially.
2. **KataGo Analysis Layer** — Runs the KataGo analysis engine on the position (or a set of candidate moves) to generate verified numerical annotations.
3. **Language Decoder (Transformer)** — A fine-tuned LLM that ingests the board encoding + KataGo annotations and generates natural language explanation.

This is architecturally analogous to an **image-captioning encoder-decoder**, where the CNN/ViT encoder is replaced by KataGo and a board serializer, and the RNN/transformer decoder generates the explanation. The key difference from generic LLM tool use is that the KataGo annotations are embedded as structured training input tokens, not just run-time prompts — meaning the model learns *how to reason from* KataGo outputs, not just how to quote them.[^4]

***

## 2. The Board Encoder

### 2.1 Text Serialization (Required for LLM Input)

The 19×19 board must be flattened into a token-efficient 1D sequence without losing spatial information. Mastermind-Go uses a proven convention:[^2][^1]

- `#` = black stone
- `o` = white stone
- `•` = empty intersection
- Coordinates: columns A–T (left to right), rows 1–19 (bottom to top); bottom-left corner = A1
- Recent moves are tagged with move index: `#(3)` = Black's 3rd stone, `o(1)` = White's 1st stone

The full board serializes to coordinate-symbol pairs, e.g., `A1: • B1: #(3) C1: o ...`, encoding both current position and recent history. LLMs process this as a structured string; the model must implicitly learn 2D spatial relationships from the 1D sequence. This is the hardest part of Go encoding — the spatial bottleneck.[^1]

### 2.2 Extended Feature Planes (For Deeper Encoding)

KataGo's own neural net uses **two input tensors** combining binary feature planes for:[^5][^6]
- Stone positions per color
- Liberty counts (one-hot encoded per count — not a single integer, to allow the net to treat "2 liberties" and "3 liberties" as categorically distinct)[^7]
- Move history (last N positions)
- Ko point
- Komi, scoring rules, board size

While KataGo uses these as CNN inputs, you can augment your text encoding by including derived features in the prompt: liberty counts per group, group sizes, and whether stones are in atari. These add critical tactical context that the bare board symbols do not convey.

### 2.3 Alternative: Hybrid CNN + Transformer Encoder

For a more powerful learned encoder, consider pre-encoding the board with a **convolutional trunk** (as in KataGo's architecture: residual blocks with pre-activation) and using the resulting feature map as a prefix to the language decoder. This is the approach taken in ResTNet, which interleaves residual and Transformer blocks to handle both local tactical and global strategic information, increasing win rates from 53.6% to 60.9% in 19×19 Go. You would freeze or lightly fine-tune KataGo's own neural net as the spatial encoder and project its internal activations into the LLM's embedding space — analogous to how LLaVA uses CLIP as a frozen visual encoder and projects features into LLaMA's embedding space.[^8][^9][^10]

***

## 3. KataGo Analysis Annotations

### 3.1 What KataGo Outputs (Verified Ground Truth)

KataGo's parallel analysis engine accepts positions via a JSON protocol and returns per-move and per-position statistics. The key fields to extract and include in your training data are:[^3]

| Field | Meaning | Use in Explanation |
|---|---|---|
| `rootInfo.winrate` | Win probability [^11] for current player | "Black wins in 87% of continuations" |
| `rootInfo.scoreLead` | Predicted point lead | "Black leads by ~5.8 points" |
| `moveInfos.move` | Best move | "KataGo's top choice is Q4" |
| `moveInfos[].order` | Rank of each candidate | Compare top-3 candidates |
| `moveInfos[].visits` | Search depth per move | Confidence signal |
| `moveInfos[].pv` | Principal variation | "Expected continuation: Q4, R5, P5…" |
| `moveInfos[].prior` | Raw policy prior | How "obvious" the move is |
| `ownership[]` | 361-length array, -1 to +1 | Territory map |
| `ownershipStdev[]` | Uncertainty per intersection | Contested regions |

These numbers are **directly verifiable against KataGo** — the model should learn to anchor its explanations to them, not generate independent estimates.[^12][^3]

### 3.2 Deriving Conceptual Features from KataGo Output

You can extract additional derived features to enrich training data before it ever reaches the LLM:

- **Ownership map → territory regions**: Cluster ownership values into "clearly Black" (>0.8), "contested" (−0.2 to 0.2), "clearly White" (<−0.8). Label these as text: "Lower-left is firmly Black's territory; upper-right is contested."
- **Win rate delta**: Compute `moveInfos.winrate − moveInfos[^1].winrate` to measure how critical the top move is. A large delta = "only move."
- **PV overlap**: If multiple candidate moves share the same 5-move PV continuation, they are strategically equivalent.
- **Score drift across candidate moves**: If different candidate moves yield very different `scoreLead` values, the position is tactically urgent.

These derived features transform raw KataGo data into intermediate semantic concepts that the LLM can learn to articulate.[^2][^1]

***

## 4. Training Dataset Construction

This is the most critical and effort-intensive step. The dataset must be built in **four curriculum layers**, progressively from mechanics to strategy:

### Task 1: State-Transition Prediction (Rule Mastery)

**Format**: `(board_state, move) → next_board_state`

Generate 150,000+ examples from KataGo self-play games. At each turn, show the board and the move played, and ask the model to predict the resulting board state (including captures). This teaches the model the fundamental rules — capturing, ko, group connectivity — without which no explanation is valid.[^1]

**Why it matters**: Mastermind-Go found near-perfect accuracy (99.44% single-task, 96.08% multi-task) on this task after fine-tuning. The model cannot reason about life-and-death, ladders, or snapbacks without internalizing capture rules.[^1]

### Task 2: KataGo Position Analysis (Quantitative Understanding)

**Format**: `(board_state, candidate_moves, komi) → {ownership_map, score_lead, win_probability}`

For each board state in your self-play corpus:
1. Submit position to KataGo analysis engine with `includeOwnership: true`
2. Record `ownership`, `rootInfo.scoreLead`, `rootInfo.winrate`
3. Simulate each of the top-3 candidate moves and record post-move evaluations
4. Chain these into a CoT template: predict ownership → count territories → compute score difference → output winrate

Mastermind-Go used 138,693 samples from 36 KataGo trajectories, achieving Score MAE of 1.74 points and Win Rate MAE of 4.49%. Critically, integrate a **code tool ("Count" function)** that lets the model call a territory-counting subroutine rather than counting intersections manually — this dramatically reduces arithmetic errors.[^1]

### Task 3: Natural Language Explanation from Go Books (Conceptual Grounding)

**Format**: `(board_state) → [strategic explanation in natural language]`

This is where the model learns to express concepts like "influence," "thickness," "aji," "sente/gote," "moyo," "overconcentration." The source data should be:[^2][^1]

- **Go textbooks and commentary**: Kifu databases with professional commentaries (e.g., from Go review books or annotated pro game records)
- **AI game review databases**: Comments from platforms like OGS, KGS, or Tygem where players receive reviews
- **Structured commentary pairs**: Board diagram + explanation extracted from tsumego collections (life-and-death problems with solution text)

Mastermind-Go extracted state-explanation pairs from Go books, yielding only 1,503 samples — enough for basic concept coverage but insufficient for fluent strategic explanation. A production system should target **50,000+ annotated positions** with human-written or GPT-assisted commentary that is grounded in KataGo's evaluation.[^1]

**Important**: Annotations should be **conditionally generated** given KataGo's analysis. If KataGo says White leads by 8 points and Black's last move dropped win rate by 12%, the corresponding annotation should reference those facts explicitly. Never train on commentary that contradicts the ground-truth KataGo evaluation.

### Task 4: Full Integrated Decision (End-to-End Reasoning)

**Format**: `(board_state) → [step-by-step reasoning → best move → explanation]`

Combine the above three tasks into a chain-of-thought template:
1. Predict ownership (Task 2 output)
2. Assess territorial balance (derived from ownership)
3. Identify the critical area (from ownership uncertainty / contested regions)
4. State KataGo's top move and score impact
5. Generate strategic rationale (Task 3 style)
6. Declare the recommended move

This is the "full product" query format. Chain-of-thought significantly improves accuracy — Mastermind-Dou (same framework) saw action accuracy rise from 78% to 90% when CoT was added, with thought-process similarity of 98%.[^1]

***

## 5. Model Selection and Fine-Tuning

### 5.1 Base Model Choice

Any capable open-source transformer works as a base. Mastermind-Go used **LLaMA-2-7B**. In 2026, better starting points include:[^1]

- **LLaMA-3.1-8B or 70B** — Strong reasoning, longer context
- **Mistral-7B or Mixtral-8x7B** — Efficient with long structured inputs
- **Qwen2.5-7B** — Particularly strong on structured data and spatial reasoning

Larger models benefit more from the game-data fine-tuning; Mastermind found that Gemma-7B outperformed Gemma-2B by 10 percentage points on action accuracy after the same fine-tuning.[^1]

### 5.2 Supervised Fine-Tuning (SFT)

Use standard SFT with **LoRA** for efficiency:

```python
# Key hyperparameters (from Mastermind-Go):
# LoRA: r=32, alpha=64
# Learning rate: 5e-5 with cosine decay
# Gradient clipping: norm=1.0
# Dropout: 0.1
# Hardware: 8x A100 GPUs
# Optimizer: AdamW
```

Training loss is standard next-token cross-entropy over the answer tokens only (question tokens masked):[^1]

\[L_\theta = \frac{1}{N} \sum_{i=1}^{N} \log_\theta(Y_i \mid X_i)\]

Mix Task 1, 2, 3, and 4 data with **calibrated proportions** — not equal splits. Task 2 (KataGo analysis) should be most heavily weighted since it's the quantitative grounding layer. A reasonable starting ratio is 40:35:15:10 for Tasks 1:2:3:4, then adjust based on validation metrics.

### 5.3 Alignment: DPO over RLHF

After SFT, use **Direct Preference Optimization (DPO)** rather than full RLHF to align the explanation quality with human Go knowledge. DPO requires only two models in memory (policy + reference) instead of four, making it feasible without frontier-scale compute.[^13][^14][^15]

**Preference data construction**:
- For each position, generate 4–6 candidate explanations from the SFT model
- Have dan-level Go players rank the explanations on: strategic accuracy, conceptual depth, grounding in KataGo data, readability
- Feed the ranked pairs into DPO training

**Crucially**: Ensure that preferred explanations are ones that *accurately reflect KataGo's evaluation*. An explanation that is fluent but contradicts the win-rate data should always be ranked lower, even if it sounds better to a casual reader.[^16]

***

## 6. Verifiability Mechanisms

Deep understanding without verifiability is just hallucination. Build explicit grounding constraints:

### 6.1 Citation-Style Grounding

Train the model to inline-cite KataGo values when making quantitative claims. For example:

> "This move drops Black's win rate from 73% to 61% [KataGo: −12%] and loses approximately 3.4 points [KataGo: scoreLead 8.1 → 4.7]. White now threatens to cut the two black groups apart, as shown in the principal variation: R16, Q17, R15…"

When the model learns to generate in this format, any deviation from KataGo's actual values is immediately auditable.

### 6.2 Structured Output with Validation

Use a structured generation schema (JSON + prose hybrid) where KataGo fields are filled from verified API output, and only the explanation text is LLM-generated:

```json
{
  "winrate": 0.73,         // KataGo API — not generated
  "score_lead": 5.2,       // KataGo API — not generated
  "best_move": "R16",      // KataGo API — not generated
  "ownership_clusters": {  // derived from KataGo — not generated
    "black_secure": ["lower-left corner", "right side"],
    "contested": ["upper-center"],
    "white_secure": ["upper-left corner"]
  },
  "explanation": "..."     // LLM-generated, must reference above
}
```

This approach — structuring outputs and parsing them with strict schemas — is well-established for constraining LLM generation to validated data.[^17]

### 6.3 A Separate Critic/Verifier

Train a small critic model to flag explanations that contradict KataGo data. Feed it `(KataGo_output, explanation)` pairs and classify as "consistent" or "inconsistent." This can be used both during DPO data curation and at inference time as a filter.

***

## 7. Architecture Variants: From Simple to Deep

| Approach | Effort | Explanation Quality | Verifiability |
|---|---|---|---|
| **GPT-5.5 + KataGo MCP (prompt only)** | Low (hours) | Moderate; explanation is improvised | Partial (numbers grounded, concepts not) |
| **SFT on KataGo analysis data (Tasks 1–2)** | Medium (days) | Good on quantitative reasoning | High for evaluation claims |
| **Full Mastermind-Go 4-task SFT** | High (weeks) | Good; concepts from Go books | Moderate for strategic explanations |
| **Full SFT + DPO with dan-level feedback** | Very high (months) | Strong; aligned with expert understanding | High if citation-grounding is trained |
| **Hybrid CNN encoder (frozen KataGo trunk) + LLM decoder** | Research-level | Potentially highest; model "sees" the board | Requires interpretability work |

***

## 8. Known Failure Modes and Mitigations

### Catastrophic Forgetting
Mastermind-Go observed performance drops on date understanding and spatial reasoning in BBH benchmarks after fine-tuning. Mitigation: Mix general instruction data into fine-tuning (5–10% of total tokens) or use continual learning techniques.[^1]

### Ownership Map Counting Errors
LLMs struggle to count intersections in the ownership array reliably. Mastermind-Go solved this with a programmatic "Count" tool function that the LLM calls during generation. This is essential — do not try to make the model count territories natively.[^1]

### Explanation–Evaluation Mismatch
The biggest risk: the model generates strategically plausible text that contradicts KataGo's numerical evaluation. Mitigations include citation-grounded training data, structured output constraints, and the DPO critic described above.[^16][^17]

### KataGo's Own Limitations
KataGo can give overconfident or inconsistent evaluations in endgame positions and known adversarial patterns. The model may learn to faithfully interpret KataGo errors as correct. Mitigation: use high-visit analysis (≥2,000 visits per position) and cross-check with multiple random symmetry evaluations (`rootNumSymmetriesToSample: 8`).[^18][^3]

***

## 9. Implementation Checklist

- [ ] Set up KataGo analysis engine with GPU, configure `analysis_example.cfg`
- [ ] Generate 200–500 KataGo self-play games at two strength levels (optimal + Top-p=0.4)
- [ ] Build board serializer (text: `#`, `o`, `•` notation with move indices)
- [ ] Extract Task 1 data: 150k state-transition examples
- [ ] Extract Task 2 data: 140k position-analysis examples with `includeOwnership: true`
- [ ] Curate Task 3 data: 5k–50k board+commentary pairs from Go books / annotated games
- [ ] Build "Count" tool function for territory counting
- [ ] Implement citation-grounded training template
- [ ] Fine-tune base LLM (≥7B) with LoRA using 4-task curriculum
- [ ] Generate preference pairs for DPO; recruit dan-level annotators
- [ ] Run DPO alignment pass
- [ ] Build structured output validator against KataGo source values
- [ ] Evaluate: Score MAE, Win Rate MAE, explanation Rouge-L, human strategic accuracy rating

---

## References

1. [Empowering LLMs in Decision Games through Algorithmic Data ...](https://arxiv.org/html/2503.13980v1) - Specifically, we fine-tune the LLM to learn the common pattern among expert players, thereby enablin...

2. [Mastermind-Go: LLM for Advanced Go Strategy - Emergent Mind](https://www.emergentmind.com/topics/mastermind-go) - Mastermind-Go is an LLM fine-tuned to master Go strategy through data synthesis, self-play, and MCTS...

3. [KataGo/docs/Analysis_Engine.md at master · lightvector ... - GitHub](https://github.com/lightvector/KataGo/blob/master/docs/Analysis_Engine.md) - KataGo contains an engine that can be used to analyze large numbers of positions in parallel (entire...

4. [Image captioning using encoder-decoder - Harshit Kumar](https://kharshit.github.io/blog/2019/01/11/image-captioning-using-encoder-decoder) - In image captioning, the core idea is to use CNN as encoder and a normal RNN as decoder. This applic...

5. [[PDF] Accelerating Self-Play Learning in Go - arXiv](https://arxiv.org/pdf/1902.10565.pdf)

6. [KataGo/docs/KataGoMethods.md at master - GitHub](https://github.com/lightvector/KataGo/blob/master/docs/KataGoMethods.md) - KataGo was possibly the first superhuman selfplay learning Go bot capable of playing on a wide range...

7. [can someone explain AlphaGo's feature input in detail? - Reddit](https://www.reddit.com/r/baduk/comments/5xsw13/can_someone_explain_alphagos_feature_input_in/) - I'm not an expert on neural networks, but my understanding of these feature planes is they are each ...

8. [Bridging Local and Global Knowledge via Transformer in Board ...](https://arxiv.org/html/2410.05347v2) - In ResTNet, the residual and Transformer blocks process different input features. The residual block...

9. [[PDF] Bridging Local and Global Knowledge via Transformer in Board ...](https://www.ijcai.org/proceedings/2025/0828.pdf) - Experiments show ResTNet improves playing strength, with win rates increasing from 54.6% to. 60.8% i...

10. [How do Multimodal Models Process and Understand Images?](https://weaxsey.org/en/articles/2025-06-15/) - From Vision Transformers to image-text alignment, exploring the core technical principles and implem...

11. [OpenAI GPT-5.5 Released: What It Does, Who Gets It, and ... - AI Tools](https://aitoolsrecap.com/Blog/openai-gpt-5-5-release-chatgpt-codex-2026) - OpenAI released GPT-5.5 on April 23, 2026 for Plus, Pro, Business and Enterprise. API coming soon at...

12. [KataGo MCP Server - LobeHub](https://lobehub.com/mcp/dmmcquay-katago-mcp) - An MCP server that provides KataGo analysis capabilities to AI assistants like Claude. Requires exte...

13. [RLHF vs DPO in LLM fine-tuning: 60+ patent analysis | PatSnap](https://www.patsnap.com/resources/blog/articles/rlhf-vs-dpo-in-llm-fine-tuning-60-patent-analysis-2/) - How does RLHF differ from DPO in LLM fine-tuning? A patent-backed technical comparison of architectu...

14. [RLHF Explained: How Human Feedback Trains AI Models in 2026](https://decodethefuture.org/en/rlhf-explained/) - RLHF uses human preferences to align LLMs. Learn the 3-stage pipeline, PPO math, reward model code, ...

15. [RLHF without RL - Direct Preference Optimization](https://iclr-blogposts.github.io/2024/blog/rlhf-without-rl/) - We discuss the RL part of RLHF and its recent displacement by direct preference optimization (DPO). ...

16. [Data Labeling for LLMs: Annotation Methods, Quality & Governance](https://atlan.com/know/data-labeling-best-practices-llms/) - Learn data labeling best practices for LLMs including hybrid annotation workflows, RLHF alignment, q...

17. [LLM Output Parsing and Structured Generation Guide - Tetrate](https://tetrate.io/learn/ai/llm-output-parsing-structured-generation) - Learn about llm output parsing and structured generation guide. Comprehensive guide for developers a...

18. [[PDF] Analysing KATAGO: A Comparative Evaluation Against Perfect Play ...](http://webdocs.cs.ualberta.ca/~mmueller/ps/2024/2024-CG-Husna-Analysing-KataGo.pdf) - KataGo is the strongest open-source Go program, and is very widely used as a study tool in the Go co...

