A Multi-Aspect Framework for Counter Narrative Evaluation using Large Language Models

Anonymous

A Multi-Aspect Framework for Counter Narrative Evaluation using Large Language Models

Anonymous

16 Dec 2023ACL ARR 2023 December Blind SubmissionReaders: Everyone

TL;DR: We propose a novel multi-aspect evaluation framework for counter narratives - informed responses to hate speech contexts - by prompting LLMs to provide evaluation scores and feedback directly based on key aspects inspired by NGO guidelines.

Abstract: Counter narratives --- informed responses to hate speech contexts designed to refute hateful claims and de-escalate encounters --- have emerged as an effective hate speech intervention strategy. While previous work has proposed automatic counter narrative generation methods to aid manual interventions, the evaluation of these approaches remains underdeveloped. Previous automatic metrics for counter narrative evaluation lack alignment with human judgment as they rely on superficial reference comparisons instead of incorporating key aspects of counter narrative quality as evaluation criteria. To address prior evaluation limitations, we propose a novel evaluation framework prompting LLMs to provide scores and feedback for generated counter narrative candidates using 5 defined aspects derived from guidelines from counter narrative specialized NGOs. We found that LLM evaluators achieve strong alignment to human-annotated scores and feedback and outperform alternative metrics, indicating their potential as multi-aspect, reference-free and interpretable evaluators for counter narrative evaluation.

Paper Type: short

Research Area: Resources and Evaluation

Languages Studied: English

0 Replies

Loading