Target-Oriented Sentiment Classification with Sequential Cross-Modal Semantic Graph

Yufeng Huang, Zhuo Chen, Jiaoyan Chen, Jeff Z. Pan, Zhen Yao, Wen Zhang

Published: 2023, Last Modified: 21 Feb 2025ICANN (4) 2023EveryoneRevisionsBibTeXCC BY-SA 4.0

Abstract: Multi-modal aspect-based sentiment classification (MABSC) is an approach aimed at classifying the sentiment of a target entity mentioned in a sentence using images. However, previous methods failed to account for the fine-grained semantic association between the image and the text, which resulted in limited identification of fine-grained image aspects and opinions. To address these limitations, a new approach called SeqCSG has been proposed in this paper. SeqCSG enhances the encoder-decoder sentiment classification framework using sequential cross-modal semantic graphs. SeqCSG utilizes image captions and scene graphs to extract both global and local fine-grained image information and considers them as elements of the cross-modal semantic graph along with tokens from tweets. The sequential cross-modal semantic graph is represented as a sequence with a multi-modal adjacency matrix indicating relationships between elements. Experimental results have shown that the approach outperforms existing methods and achieves state-of-the-art performance on two standard datasets. Further analysis has demonstrated that the model can implicitly learn the correlation between fine-grained information of the image and the text with the given target.