Learning fine-grained representation with token-level alignment for multimodal sentiment analysis

Published: 01 Jan 2025, Last Modified: 07 Mar 2025Expert Syst. Appl. 2025EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Highlights•Propose fine-grained multimodal fusion network for sentiment analysis.•Extract fine-grained sentiment representations using fewer denoising tokens.•Perform token-level alignment to facilitate representation learning and fusion.•Generate consistent multimodal representations via correlation-aware fusion.
Loading