T-FIX: Text-Based Explanations with Features Interpretable to eXperts

Published: 24 Sept 2025, Last Modified: 24 Sept 2025NeurIPS 2025 LLM Evaluation Workshop PosterEveryoneRevisionsBibTeXCC BY 4.0
Keywords: interpretability, evaluation, domain alignment, applications
Abstract: As LLMs are deployed in knowledge-intensive settings, professionals need confidence that a model’s reasoning matches domain expertise. Current explanation evaluations focus on plausibility or internal faithfulness, often overlooking alignment with expert intuition. We define expert alignment as a key criterion for evaluating explanations and introduce T-FIX, a benchmark designed to evaluate how well LLM explanations align with expert judgment across seven knowledge-intensive fields.
Submission Number: 156
Loading