# Claim Consistency Coupling – Scaled Experiment Results

## Hyperparameters

| Parameter | Value |
|---|---|
| num_train_samples | 5120 |
| num_eval_samples | 512 |
| num_shuffled_samples | 256 |
| num_epochs | 30 |
| num_latent_states | 8 |
| num_rationale_templates | 4 |
| d_model | 64 |
| n_layers | 2 |
| n_heads | 4 |
| d_ff | 128 |
| batch_size | 32 |
| lr | 0.0003 |
| consistency_loss_weight | 0.5 |
| seed | 42 |

## Results

| variant             |   final_lm_loss |   final_cons_loss |   gen_claim_acc |   cls_claim_acc (rationale_pool) |   cfact_gen_follows_swap |   cfact_gen_follows_orig |   cfact_cls_follows_swap |   cfact_cls_follows_orig |   shuffled_gen_acc |   shuffled_cls_acc |
|:--------------------|----------------:|------------------:|----------------:|---------------------------------:|-------------------------:|-------------------------:|-------------------------:|-------------------------:|-------------------:|-------------------:|
| no_consistency_loss |          0.7381 |            0      |               1 |                           0.0879 |                        1 |                        0 |                   0.0781 |                   0.2344 |             0.1289 |             0.1641 |
| rationale_only      |          0.7386 |            0.0007 |               1 |                           1      |                        1 |                        0 |                   1      |                   0      |             0.1289 |             0.1289 |
| full_sequence       |          0.7381 |            0.0011 |               1 |                           1      |                        1 |                        0 |                   1      |                   0      |             0.1289 |             0.1289 |
| earlier_token_only  |          0.7388 |            0.0014 |               1 |                           1      |                        1 |                        0 |                   1      |                   0      |             0.1289 |             0.1289 |

## Column Descriptions

- **variant**: Training objective variant (pooling mode for consistency loss)
- **final_lm_loss**: Cross-entropy language modeling loss at end of training
- **final_cons_loss**: Consistency classification loss at end of training
- **gen_claim_acc**: Greedy generation accuracy – first generated token matches expected claim token
- **cls_claim_acc (rationale_pool)**: Classifier accuracy from mean-pooled rationale hidden states
- **cfact_gen_follows_swap**: Rate that generation follows the swapped (wrong) rationale in counterfactual test
- **cfact_gen_follows_orig**: Rate that generation follows the original claim despite swapped rationale
- **cfact_cls_follows_swap**: Rate that classifier follows swapped rationale
- **cfact_cls_follows_orig**: Rate that classifier follows original claim despite swap
- **shuffled_gen_acc**: Generation accuracy when rationale-claim pairings are shuffled (mismatch control)
- **shuffled_cls_acc**: Classifier accuracy under shuffled-pairing control
