Examining LLMs' Uncertainty Expression towards Questions outside Parametric Knowledge

Anonymous

Examining LLMs' Uncertainty Expression towards Questions outside Parametric Knowledge

Anonymous

16 Feb 2024ACL ARR 2024 February Blind SubmissionReaders: Everyone

Abstract: Can large language models (LLMs) express their uncertainty in situations where they lack sufficient parametric knowledge to generate reasonable responses? This work aims to systematically investigate LLMs' behaviors in such situations, emphasizing the trade-off between honesty and helpfulness. To tackle the challenge of precisely determining LLMs' "knowledge gaps", we diagnostically create unanswerable questions containing non-existent concepts or false premises, ensuring these are outside the LLMs' vast training data. By compiling a benchmark, UnknownBench, which consists of both unanswerable and answerable questions, we quantitatively evaluate the LLMs' performance in maintaining honesty while being helpful. Using a model-agnostic unified confidence elicitation approach, we observe that most LLMs fail to consistently refuse or express high uncertainty towards questions outside their parametric knowledge, although instruction fine-tuning and alignment techniques can provide marginal enhancements. Moreover, LLMs' uncertainty expression does not always stay consistent with the perceived confidence of their direct responses. We will release our data and code.

Paper Type: short

Research Area: Interpretability and Analysis of Models for NLP

Contribution Types: NLP engineering experiment, Data resources, Data analysis

Languages Studied: English

0 Replies

Loading