| variant | gen_claim_acc smoke | gen_claim_acc scaled | Δ | cls_claim_acc (rationale_pool) smoke | cls_claim_acc (rationale_pool) scaled | Δ | cfact_gen_follows_swap smoke | cfact_gen_follows_swap scaled | Δ | cfact_gen_follows_orig smoke | cfact_gen_follows_orig scaled | Δ | cfact_cls_follows_swap smoke | cfact_cls_follows_swap scaled | Δ | cfact_cls_follows_orig smoke | cfact_cls_follows_orig scaled | Δ | shuffled_gen_acc smoke | shuffled_gen_acc scaled | Δ | shuffled_cls_acc smoke | shuffled_cls_acc scaled | Δ | final_lm_loss smoke | final_lm_loss scaled | Δ |
| --- | ---: | ---: | ---: | ---: | ---: | ---: | ---: | ---: | ---: | ---: | ---: | ---: | ---: | ---: | ---: | ---: | ---: | ---: | ---: | ---: | ---: | ---: | ---: | ---: | ---: | ---: | ---: |
| no_consistency_loss | 0.7500 | 1.0000 | +0.2500 | 0.0391 | 0.0879 | +0.0488 | 0.6719 | 1.0000 | +0.3281 | 0.0312 | 0.0000 | -0.0312 | 0.0312 | 0.0781 | +0.0469 | 0.1719 | 0.2344 | +0.0625 | 0.1016 | 0.1289 | +0.0273 | 0.1172 | 0.1641 | +0.0469 | 3.4395 | 0.7381 | -2.7014 |
| rationale_only | 0.6328 | 1.0000 | +0.3672 | 1.0000 | 1.0000 | +0.0000 | 0.5469 | 1.0000 | +0.4531 | 0.0938 | 0.0000 | -0.0938 | 1.0000 | 1.0000 | +0.0000 | 0.0000 | 0.0000 | +0.0000 | 0.0781 | 0.1289 | +0.0508 | 0.1016 | 0.1289 | +0.0273 | 3.5457 | 0.7386 | -2.8071 |
| full_sequence | 0.9375 | 1.0000 | +0.0625 | 1.0000 | 1.0000 | +0.0000 | 0.9375 | 1.0000 | +0.0625 | 0.0156 | 0.0000 | -0.0156 | 1.0000 | 1.0000 | +0.0000 | 0.0000 | 0.0000 | +0.0000 | 0.0938 | 0.1289 | +0.0351 | 0.1016 | 0.1289 | +0.0273 | 3.5428 | 0.7381 | -2.8047 |
| earlier_token_only | 0.6172 | 1.0000 | +0.3828 | 1.0000 | 1.0000 | +0.0000 | 0.5312 | 1.0000 | +0.4688 | 0.1094 | 0.0000 | -0.1094 | 1.0000 | 1.0000 | +0.0000 | 0.0000 | 0.0000 | +0.0000 | 0.0859 | 0.1289 | +0.0430 | 0.1016 | 0.1289 | +0.0273 | 3.5301 | 0.7388 | -2.7913 |
