### Code Blocks

**Oracle Implementation Snippet**
```python
def oracle_verify(func_code: str, func_name: str) -> dict:
 """
 Returns all 12 claim types by:
 - Parsing AST for loop structure, mutations, operations
 - Running test cases for correctness, edge cases
 - Computing complexity from loop nesting depth + recursion
 - Pattern matching for algorithm class (keywords, structure)
 """
 return {
 'time_complexity': 'O_n2',
 'space_complexity': 'O_1',
 'best_case_time': 'O_n2',
 'algorithm_class': 'sorting',
 'loop_structure': 'nested_2_level',
 'key_operation': 'comparison',
 'access_pattern': 'sequential',
 'auxiliary_structures': 'temp_variable',
 'mutates_input': True,
 'correctness_status': 'fully_correct',
 'handles_empty_input': True,
 'handles_duplicates': 'preserves_all',
 }
```

**Dataset Structure (dataset_rich.py)**
```python
@dataclass
class RichExample:
 idx: int
 code_snippet: str
 true_explanation: str # 3-5 sentences
 mismatched_explanation: str
 # 12 claim types as separate fields
 time_complexity: str
 space_complexity: str
 best_case_time: str
 algorithm_class: str
 loop_structure: str
 key_operation: str
 access_pattern: str
 auxiliary_structures: str
 mutates_input: bool
 correctness_status: str
 handles_empty_input: bool
 handles_duplicates: str
 template_name: str

def build_rich_dataset(n=2500, seed=42) -> List[RichExample]:
 """Generate n examples with programmatic oracle."""
 ...
```

**Tokenizer Extensions**
```python
CLAIM_TOKENS = [
 'time_complexity=O_1', 'time_complexity=O_n', 'time_complexity=O_n2', ...,
 'algorithm_class=sorting', 'algorithm_class=searching', ...,
 'loop_structure=single_pass', 'loop_structure=nested_2_level', ...,
 # ... all claim value combinations
]
```

**Target Sequence Format**
```text
<bos> {code} <sep> {explanation}
<claim>time_complexity=O_n2</claim>
<claim>space_complexity=O_1</claim>
<claim>best_case_time=O_n2</claim>
<claim>algorithm_class=sorting</claim>
<claim>loop_structure=nested_2_level</claim>
<claim>key_operation=comparison</claim>
<claim>access_pattern=sequential</claim>
<claim>auxiliary_structures=temp_variable</claim>
<claim>mutates_input=true</claim>
<claim>correctness_status=fully_correct</claim>
<claim>handles_empty_input=true</claim>
<claim>handles_duplicates=preserves_all</claim>
<eos>
```

**Model Architecture Changes (model.py)**
```python
# In model.py
self.time_head = nn.Linear(cfg.d_model, 6) # 6 time complexity bins
self.space_head = nn.Linear(cfg.d_model, 4) # 4 space bins
self.bestcase_head = nn.Linear(cfg.d_model, 6)
self.algclass_head = nn.Linear(cfg.d_model, 7) # 7 algorithm classes
self.loopstruct_head = nn.Linear(cfg.d_model, 6)
self.keyop_head = nn.Linear(cfg.d_model, 7)
self.accesspat_head = nn.Linear(cfg.d_model, 5)
self.auxstruct_head = nn.Linear(cfg.d_model, 7)
self.mutates_head = nn.Linear(cfg.d_model, 2)
self.correctness_head = nn.Linear(cfg.d_model, 5)
self.emptyinput_head = nn.Linear(cfg.d_model, 3)
self.duplicates_head = nn.Linear(cfg.d_model, 4)

# Consistency loss averages over all 12 heads
```

**Execution Commands (run_rich_experiment.py)**
```bash
python run_rich_experiment.py --full # 20 epochs, 2500 examples
python run_rich_experiment.py --smoke # 3 epochs, 300 examples for testing
```

**Bubble Sort Explanation Ground-Truth Example**
```text
"This function implements bubble sort by repeatedly comparing adjacent elements and swapping them if out of order. It uses nested loops to make multiple passes over the list, resulting in O(n²) time complexity in all cases. The algorithm modifies the input list in-place using only constant auxiliary space for the swap operation. It correctly handles empty lists and preserves duplicate values while maintaining stability."
```

**Garbled Output Example (Main Variant)**
```text
"tatit p er nation ch esap er sarsandcondites in gparsiteandsitev in g in gleandicantion in s.
Time O n space O1. Best-case O1 when stelement. Keyoperation is ss..."
```

**Adversarial Minimal Pairs Snippet**
```python
# Template A: Correct binary search
def search(arr, target):
 lo, hi = 0, len(arr)
 while lo < hi:
 mid = (lo + hi) // 2
 if arr[mid] < target:
 lo = mid + 1
 else:
 hi = mid
 return lo

# Template B: Off-by-one bug (< vs <=)
def search(arr, target):
 lo, hi = 0, len(arr)
 while lo <= hi: # BUG
 mid = (lo + hi) // 2
 if arr[mid] < target:
 lo = mid + 1
 else:
 hi = mid
 return lo
```

**Pretrained Model Variants List**
```python
Variants to run:
1. pretrained_frozen # Zero-shot, no training
2. pretrained_lm_only # Fine-tune on mismatched pairs (corruption test)
3. pretrained_consistency # LM + consistency loss (main hypothesis)
4. pretrained_claim_only # Consistency on claims, not explanations
5. pretrained_expl_only_verifier # Claims verified from text only
6. pretrained_surface_bottleneck # Consistency from token logits
```

### Filenames Mentioned
*   `dataset_rich.py`
*   `model_rich.py`
*   `trainer_rich.py`
*   `run_rich_experiment.py`
*   `manual_review_scores_rich_full.csv`
*   `manual_review_annotations_rich_full.csv`
*   `report.pplx.md`
*   `qualitative_side_by_side_rich.pplx.md`
*   `manual_qualitative_review_rich_full.pplx.md`
*   `dataset.py`
*   `model.py`

### Data Tables and Metric Summaries

**Rich Results Summary Table**
| Variant | Mean_Coupling | Swap_Influence | BLEU_1 | ROUGE_L | Manual_Usable | Manual_Correct |
| :--- | :--- | :--- | :--- | :--- | :--- | :--- |
| Consistency Loss (Main) | 0.986 | 0.977777777777778 | 0.0610720361509835 | 0.0611787422934043 | 0 | 0 |
| No Consistency Loss | 0.2863333333333333 | -0.0333333333333333 | 0.0686355661881977 | 0.0617427855224923 | 0 | 0 |
| Claim-Only Pooling | 0.5946666666666666 | 0.1222222222222222 | 0.0554189261031366 | 0.0478925556264365 | 0 | 0 |
| Random Label Consistency | 0.562 | -0.0222222222222222 | 0.0593285486443381 | 0.0526238305112604 | 0 | 0 |
| No Claim↔Claim Attn (V2) | 0.9858333333333336 | 0.977777777777778 | 0.0614013822434875 | 0.0594859466458738 | 0 | 0 |
| Claims from Expl Only (V2) | 0.986 | 0.977777777777778 | 0.0557886762360446 | 0.0569592942588268 | 0 | 0 |
| Surface Bottleneck (V2) | 0.5501666666666667 | -0.0333333333333333 | 0.0687671451355661 | 0.0646380236177304 | 0 | 0 |
| Surface Btlnk No Expl LM (V2) | 0.547 | -0.0222222222222222 | 0.0 | 0.0 | 0 | 0 |

**Structural Parallelism: Go vs. Code (Complexity)**
| Dimension | Go (KataGo) | Code (Complexity) |
| :--- | :--- | :--- |
| Generated text | Move commentary | Algorithm explanation |
| Verifiable claim | Win probability bin | Complexity class |
| Oracle | KataGo analysis | Static analyzer |
| Claim format | Scalar/categorical | Categorical (O-notation) |

**Structural Parallel: Go vs. Code (Explanations)**
| Aspect | Go Domain | Code Domain |
| :--- | :--- | :--- |
| Input | Board position | Code snippet |
| Generated output | Natural language commentary | Natural language explanation |
| Interleaved claims | Win-prob, score-lead, urgency | Complexity, correctness, behavior |
| Oracle | KataGo analysis | Static analyzer / test suite |

**Comparison Questions for Pretrained Models**
| Question | Comparison |
| :--- | :--- |
| Does fine-tuning corrupt explanations? | `frozen` vs `lm_only` |
| Does consistency prevent corruption? | `lm_only` vs `consistency` |
| Is the mechanism representation or text? | `consistency` vs `expl_only_verifier` |
| Do claims need to be verbalized? | `consistency` vs `expl_only_verifier` |

### Statistics and Specific Metrics

**FEVER Proxy Results**
*   Accuracy: 100%
*   Faithfulness: 100%
*   Swap influence: 0%

**Rich-Claim Experiment Summary Metrics**
*   Mean coupling (Main Variant): 98.6%
*   Swap influence (Main Variant): 97.8%
*   Mean coupling (No Consistency Baseline): 28.6%
*   Random labels coupling: 56.2%
*   Claim-only pooling coupling: 59.5%
*   Surface bottleneck coupling: 55%
*   Manual review usable explanations: 0/20
*   BLEU-1 (Main Variant): ~0.061
*   Claim emission accuracy: 2.2%

**Target Validation Metrics (Target Goals)**
*   Classifier Accuracy: >90% (KataGo parallel)
*   Swap-following: >95%
*   Mean Coupling (Rich Experiment): >85%
*   BLEU-1 / ROUGE-L: 0.30-0.50

**Data and Scale Requirements**
*   Rich dataset training: 2,500 examples
*   Rich dataset validation: 500 examples
*   Algorithm complexity dataset: 2,000-5,000 algorithm implementations
*   Initial coupling signal requirement: 2,000-3,000 examples
*   Publication-quality requirement: 5,000-10,000 examples
*   Training duration: 10-20 epochs (1-2 days on single GPU)
*   Model parameter range: 1-3B or 7B (CodeLlama 7B, StarCoder 1B, CodeGen-small 350M)

### Lists of Specifications

**Rich Claim Ontology (12 Types)**
1.  **Complexity Claims**:
    *   `time_complexity`: {O_1, O_log_n, O_n, O_n_log_n, O_n2, O_2n}
    *   `space_complexity`: {O_1, O_log_n, O_n, O_n2}
    *   `best_case_time`: {O_1, O_log_n, O_n, O_n_log_n, O_n2, same_as_worst}
2.  **Algorithmic Structure Claims**:
    *   `algorithm_class`: {sorting, searching, traversal, dynamic_programming, greedy, math_computation, string_processing}
    *   `loop_structure`: {no_loops, single_pass, nested_2_level, nested_3_level, recursive, iterative_with_recursion}
    *   `key_operation`: {comparison, arithmetic, hash_lookup, list_append, string_concat, swap, assignment}
    *   `access_pattern`: {sequential, random_access, sliding_window, two_pointer, divide_conquer}
3.  **Data Structure Claims**:
    *   `auxiliary_structures`: {none, temp_variable, array, hash_map, set, stack, recursion_stack}
    *   `mutates_input`: {true, false}
4.  **Correctness Claims**:
    *   `correctness_status`: {fully_correct, off_by_one, wrong_condition, missing_edge_case, infinite_loop_risk}
    *   `handles_empty_input`: {true, false, crashes}
    *   `handles_duplicates`: {preserves_all, removes_duplicates, undefined_behavior, not_applicable}

**Code Generation Specs for Dataset**
*   Length: 5-20 lines
*   Bug rate: ~20%
*   Edge cases: empty input, single element, duplicates, negative numbers, None values.

### Downloadable Artifacts
*Specific downloadable links are not present on the page; however, the following files are cited as part of the experiment output structure:*
*   `manual_review_scores_rich_full.csv`
*   `manual_review_annotations_rich_full.csv`
*   `rich_results_summary` (implied CSV/Table)