GENUINE: Graph Enhanced Multi-level Uncertainty Estimation for Large Language Models

GENUINE: Graph Enhanced Multi-level Uncertainty Estimation for Large Language Models

ACL ARR 2025 May Submission3672 Authors

19 May 2025 (modified: 29 Jul 2025)ACL ARR 2025 May SubmissionEveryoneRevisionsBibTeXCC BY 4.0

Abstract: Uncertainty estimation is essential for enhancing the reliability of Large Language Models (LLMs), particularly in high-stakes applications. Existing methods often overlook semantic dependencies, relying on token-level probability measures that fail to capture structural relationships within the generated text. We propose GENUINE: Graph ENhanced mUlti-level uncertaINty Estimation for Large Language Models, a structure-aware framework that leverages dependency parse trees and hierarchical graph pooling to refine uncertainty quantification. By incorporating supervised learning, GENUINE effectively models semantic and structural relationships, improving confidence assessments. Extensive experiments across NLP tasks show that GENUINE achieves up to 29% higher AUROC than semantic entropy-based approaches and reduces calibration errors by over 15%, demonstrating the effectiveness of graph-based uncertainty modeling. The code is available at https://anonymous.4open.science/r/GUQ-39E7.

Paper Type: Long

Research Area: Interpretability and Analysis of Models for NLP

Research Area Keywords: Interpretability and Analysis of Models for NLP, Machine Learning for NLP, NLP Applications

Contribution Types: Model analysis & interpretability, Data analysis

Languages Studied: English

Keywords: Uncertainty quantification, Interpretability and Analysis of Models for NLP, Machine Learning for NLP, NLP Applications

Submission Number: 3672

Loading