Keywords: Human Emotion Recognition, Machine Learning, Benchmark, Dataset
TL;DR: We introduce EmoNet-Face, a suite of diverse, AI-generated facial emotion datasets with a novel 40-category taxonomy, to improve AI's ability to understand human emotions more accurately and empathetically.
Abstract: Effective human-AI interaction relies on AI's ability to accurately perceive and interpret human emotions. Current benchmarks for vision and vision-language models are severely limited, offering a narrow emotional spectrum that overlooks nuanced states (e.g., bitterness, intoxication) and fails to distinguish subtle differences between related feelings (e.g., shame vs. embarrassment). Existing datasets also often use uncontrolled imagery with occluded faces and lack demographic diversity, risking significant bias. To address these critical gaps, we introduce EmoNet Face, a comprehensive benchmark suite. EmoNet Face features: (1) A novel 40-category emotion taxonomy, meticulously derived from foundational research to capture finer details of human emotional experiences. (2) Three large-scale, AI-generated datasets (EmoNet HQ, Binary, and Big) with explicit, full-face expressions and controlled demographic balance across ethnicity, age, and gender. (3) Rigorous, multi-expert annotations for training and high-fidelity evaluation. (4) We build Empathic Insight Face, a model achieving human-expert-level performance on our benchmark. The publicly released EmoNet Face suite—taxonomy, datasets, and model—provides a robust foundation for developing and evaluating AI systems with a deeper understanding of human emotions.
Croissant File:  zip
Dataset URL: https://huggingface.co/collections/laion/emonet-face-68f66ddb76d42e9ca63e9ae9
Code URL: https://github.com/laion-ai/emonet-face
Primary Area: Datasets & Benchmarks for applications in computer vision
Flagged For Ethics Review: true
Submission Number: 699
Loading