Evaluating emotional and subjective responses in synthetic art-related dialogues: A multi-stage framework with large language models
Abstract: Highlights•Large Language Models (LLM) can generate and evaluate novel dialogues.•A combination of objective metrics with LLM self-reported metrics helps to assess quality.•The proposed framework allows the generation and evaluation of synthetic dialogues.•Framework’s use case: emotional and subjective characteristics of chatbots (anthropic).
Loading