Layer of Truth: How Much Poison Is Enough? Illusory-Truth Effects in Continual Pre-training

Published: 22 Sept 2025, Last Modified: 03 Jan 2026WiML @ NeurIPS 2025EveryoneRevisionsBibTeXCC BY 4.0
Keywords: continual pre-training, misinformation, illusory truth effect, belief flipping, linear probes, data governance
Submission Number: 413
Loading