### Mathematical Formulas and Data Points

| Variable/Parameter | Definition/Formula |
| :--- | :--- |
| Completion Group | $o^{(1)}, o^{(2)}, \dots, o^{(G)}$ |
| Scalar Reward | $r^{(i)} = r(\text{board}, o^{(i)})$ |
| Mean Reward | $\bar{r} = \frac{1}{G}\sum_i r^{(i)}$ |
| Standard Deviation of Rewards | $\sigma_r = \text{std}(r^{(1)}, \dots, r^{(G)})$ |
| Group-Normalized Advantage | $A^{(i)} = \frac{r^{(i)} - \bar{r}}{\sigma_r + \epsilon}$ |

### Sequence and Architecture Structures

| Type | Structure/Sequence |
| :--- | :--- |
| Full Sequence | `board description + explanation tokens + [MOVE]` |
| Causal Chain | `... board … explanation …` |
| Move Context | `[prompt (board)] + [explanation]` |
| System Design Flow | `[BOARD AS TEXT] → [EXPLANATION TOKENS] → [MOVE TOKEN]` |
| Safe Bottleneck | `board text → transformer → explanation tokens → transformer → move (LM head)` |
| Broken Bottleneck Path 1 | `board → board encoder → board embedding → move head` |
| Broken Bottleneck Path 2 | `board text → transformer → explanation tokens` |

### Code and Token Snippets

*   `board description + explanation tokens + [MOVE]`
*   `... board … explanation …`
*   `[prompt (board)] + [explanation]`
*   `[BOARD AS TEXT] → [EXPLANATION TOKENS] → [MOVE TOKEN]`
*   `move_head(board_embedding)`
*   `move_logits = lm.forward(board_tokens + explanation_tokens)`
*   `board text → transformer → explanation tokens → transformer → move (LM head)`
*   `board → board encoder → board embedding → move head`
*   `board text → transformer → explanation tokens`
*   `<MOVE>` (Special marker)
*   `"Q16"` (Example move token)

### Specific Terms, Names, and Entities

*   **Models/Frameworks:**
    *   LoGos
    *   KataGo
    *   GRPO (Group Relative Policy Optimization)
    *   Transformer
    *   LM head
*   **Concepts:**
    *   Prefix-locked explanation
    *   Explanation bottleneck
    *   Unfaithful CoT (Chain of Thought)
    *   Causal mask
    *   Residual stream
    *   Group-relative advantages
    *   Implicit process reward
    *   SGF-like string
*   **External References/Sources:**
    *   jalammar.github+2
    *   arxiv+1
    *   arxiv+2
    *   emergentmind+1
    *   emergentmind
    *   adaline+2
    *   rlhfbook+1
    *   huggingface+2
    *   abderrahmanskiredj.github+2

### Specific Information States
*   **Filenames:** Not available.
*   **Data Tables:** Not available (structured data extracted into tables above).
*   **CSV/JSON/JSONL Snippets:** Not available.
*   **Downloadable Artifacts:** Not available.
*   **Metric Tables:** Not available.