An effective multimodal representation and fusion method for multimodal intent recognition

Xuejian Huang; Tinghuai Ma; Li Jia; Yuanjian Zhang; Huan Rong; Najla Alnabhan

An effective multimodal representation and fusion method for multimodal intent recognition

Xuejian Huang, Tinghuai Ma, Li Jia, Yuanjian Zhang, Huan Rong, Najla Alnabhan

Published: 01 Jan 2023, Last Modified: 08 Apr 2025Neurocomputing 2023EveryoneRevisionsBibTeXCC BY-SA 4.0

Abstract: Highlights•Construct modality-shared and modality-specific encoders that effectively learn shared and specific feature representations of modalities.•Propose an end-to-end multimodal representation and fusion method for multimodal intent recognition.•Propose an adaptive multimodal fusion method based on the attention-based gated neural network, which can distinguish the contributions of different modalities and reduce possible noise.•Experimental results show that the model outperforms state-of-the-art models on multiple evaluation metrics.

Loading