"""Phase 7.12: AUROC and F1 Evaluation for Instruction-Tuned Model.

This phase evaluates the discriminative power of PVA features on the
instruction-tuned Gemma model outputs from Phase 7.3, enabling direct
comparison with base model results from Phase 3.8.
"""