To Err Like Human: Affective Bias-Inspired Measures for Visual Emotion Recognition Evaluation

Chenxi Zhao; Jinglei Shi; Liqiang Nie; Jufeng Yang

To Err Like Human: Affective Bias-Inspired Measures for Visual Emotion Recognition Evaluation

Chenxi Zhao, Jinglei Shi, Liqiang Nie, Jufeng Yang

Published: 25 Sept 2024, Last Modified: 14 Nov 2024NeurIPS 2024 posterEveryoneRevisionsBibTeXCC BY 4.0

Keywords: visual emotion recognition, evaluation measure

TL;DR: New metrics inspired by cognitive nuroscience are proposed to evaluate

Abstract: Accuracy is a commonly adopted performance metric in various classification tasks, which measures the proportion of correctly classified samples among all samples. It assumes equal importance for all classes, hence equal severity for misclassifications. However, in the task of emotional classification, due to the psychological similarities between emotions, misclassifying a certain emotion into one class may be more severe than another, e.g., misclassifying 'excitement' as 'anger' apparently is more severe than as 'awe'. Albeit high meaningful for many applications, metrics capable of measuring these cases of misclassifications in visual emotion recognition tasks have yet to be explored. In this paper, based on Mikel's emotion wheel from psychology, we propose a novel approach for evaluating the performance in visual emotion recognition, which takes into account the distance on the emotion wheel between different emotions to mimic the psychological nuances of emotions. Experimental results in semi-supervised learning on emotion recognition and user study have shown that our proposed metrics is more effective than the accuracy to assess the performance and conforms to the cognitive laws of human emotions. The code is available at https://github.com/ZhaoChenxi-nku/ECC.

Supplementary Material: zip

Primary Area: Evaluation (methodology, meta studies, replicability and validity)

Submission Number: 1764

Loading