How to Catch an AI Liar: Lie Detection in Black-Box LLMs by Asking Unrelated Questions | OpenReview

How to Catch an AI Liar: Lie Detection in Black-Box LLMs by Asking Unrelated Questions

Open Webpage

Lorenzo Pacchiardi, Alex James Chan, Sören Mindermann, Ilan Moscovitz, Alexa Y. Pan, Yarin Gal, Owain Evans, Jan Markus Brauner

Published: 2024, Last Modified: 19 Feb 2025ICLR 2024EveryoneRevisionsBibTeXCC BY-SA 4.0

Loading