Adversarial alignment and graph fusion via information bottleneck for multimodal emotion recognition in conversations

Yuntao Shou, Tao Meng, Wei Ai, Fuchen Zhang, Nan Yin, Keqin Li

Published: 2024, Last Modified: 28 Sept 2024Inf. Fusion 2024EveryoneRevisionsBibTeXCC BY-SA 4.0

Abstract: Highlights•A multimodal emotion recognition architecture through adversarial alignment and graph fusion is proposed.•A cross-modal feature alignment method with adversarial learning is designed to eliminate inter-modal heterogeneity.•A graph contrastive learning method via information bottleneck is proposed to enhance multimodal semantic association.•Our method can be applied to other multimodal tasks in a plug-and-play manner, e.g., humor detection.