Abstract: In this paper, we present our submission to the EEG-Music Emotion Recognition Challenge at ICASSP 2025. Our work focused on Task 2, where the objective was to classify the emotional state of subjects in a discrete valence-arousal space while they listen to music. Our proposed solution adopts an ensemble approach, integrating electroencephalography (EEG) signals, raw audio signals, and song features. We incorporated a diverse set of models, including Audio Spectrogram Transformer (AST) [1] and a dedicated EEG model. By combining insights from these modalities, we aimed to capture the interplay between music and emotional responses. We achieved a balanced accuracy of 41.34% on held-out data, improving the baseline by more than 11% and thus 2nd place in the competition.
External IDs:dblp:conf/icassp/PauknerRSESD25
Loading