Abstract: Highlights•A cross-lingual Speech Emotion Recognition (SER) framework spanning five languages.•A semi-supervised learning based cross-lingual SER method for emotion categorization.•Two different approaches for generating pseudo-labels are investigated.•We demonstrate that a Transformer-based outperforms over a CNN-based backbone.
Loading