DENS: A Dataset for Multi-class Emotion Analysis

Chen Liu, Muhammad Osama, Anderson de Andrade

Published: 2019, Last Modified: 12 May 2025EMNLP/IJCNLP (1) 2019EveryoneRevisionsBibTeXCC BY-SA 4.0

Abstract: We introduce a new dataset for multi-class emotion analysis from long-form narratives in English. The Dataset for Emotions of Narrative Sequences (DENS) was collected from both classic literature available on Project Gutenberg and modern online narratives avail- able on Wattpad, annotated using Amazon Mechanical Turk. A number of statistics and baseline benchmarks are provided for the dataset. Of the tested techniques, we find that the fine-tuning of a pre-trained BERT model achieves the best results, with an average micro-F1 score of 60.4%. Our results show that the dataset provides a novel opportunity in emotion analysis that requires moving beyond existing sentence-level techniques.