## Table 1

| Metric | Consistency-Trained Variants | Baseline (No Consistency Loss) |
| :--- | :--- | :--- |
| Classifier Accuracy (Rationale Hidden States) | 100% | 3.9% |
| Generation Accuracy (`full_sequence`) | 93.8% | 75% |
| Generation Accuracy (`rationale_only`) | 63% | - |
| Generation Accuracy (`earlier_token_only`) | 62% | - |
| Counterfactual Swap-Following (Classifier) | 100% | 67% |
| Original State Sticking (Classifier) | 0% | 3% |
| Counterfactual Swap-Following (Gen - `full_seq`) | 93.8% | - |
| Original State Sticking (Gen - `full_seq`) | 1.6% | - |
| Shuffled-Pairing Control Accuracy | 8-10% | 8-10% |

## Table 2

| Parameter | Value |
| :--- | :--- |
| Training Samples | 512 |
| Evaluation Samples | 128 |
| Counterfactual Samples | 64 |
| Latent States | 8 |
| Training Epochs | 5 |

## Table 3

| Metric | Consistency-Trained Variants | Baseline (No Consistency Loss) |
| :--- | :--- | :--- |
| Classifier Accuracy | 100% | 4.7% |
| Counterfactual Swap-Following (Classifier) | 100% | 6.3% |
| Generation Accuracy (General) | 81-100% | 100% |
| Generation Accuracy (`rationale_only`) | 100% | - |
| Generation Accuracy (`earlier_token_only`) | 100% | - |
| Generation Accuracy (`full_sequence`) | 81% | - |
| Counterfactual Gen Swap-Following (`full_seq`) | 77% | - |

## Table 4

| Variant | Classifier Accuracy | Counterfactual Swap-Following | Generation Accuracy |
| :--- | :--- | :--- | :--- |
| `full_sequence_pooling` | 84% | 28-48% | - |
| `claim_only_pooling` | 83% | 28-48% | - |
| `evidence_only_pooling` | 44% | 48% | - |
| `no_consistency_loss` (Baseline) | 40% | - | 80% |

## Table 5

| Metric | Result |
| :--- | :--- |
| Hidden-State Intervention Success | 73-89% |
| `claim_only` Control Coupling | 43% |
