Large Language Model Relevance Assessors Agree With One Another More Than With Human Assessors

Maik Fröbe, Andrew Parry, Ferdinand Schlatt, Sean MacAvaney, Benno Stein, Martin Potthast, Matthias Hagen

Published: 13 Jul 2025, Last Modified: 21 Jan 2026CrossrefEveryoneRevisionsCC BY-SA 4.0
Loading