[GPU OK] CUDA available: 1 device(s). Primary: NVIDIA A10

[INFO] Starting generated-rationale scalar-verifier experiment...
 num_train: 2048
 num_eval: 512
 epochs: 20
 batch_size: 64
 lr: 0.0003
 model: d_model=128, layers=2, heads=4, d_ff=256
 consistency_weight: 0.5
 output_csv: generated_rationale_scalar_20260429.csv
 smoke_test: False

            variant  token_acc  claim_bin_acc  scalar_mse  cfact_cls_follows_swap  cfact_cls_follows_orig  cfact_scalar_mse_to_swap
            lm_only   0.823134       0.000000    0.128352                0.031250                0.117188                  0.102628
no_consistency_loss   0.819661       0.041016    0.000136                0.160156                0.035156                  0.201818
     rationale_only   0.819878       1.000000    0.098205                0.000000                1.000000                  0.104708
   full_consistency   0.816623       1.000000    0.000178                0.000000                1.000000                  0.171643
 random_consistency   0.819010       0.164062    0.000288                0.066406                0.140625                  0.213419
Saved to generated_rationale_scalar_20260429.csv and generated_rationale_scalar_20260429.md

[DONE]
