CM-SC: Cross-modal spatial-channel attention network for image captioning

Md. Shamim Hossain, Shamima Aktar, Mohammad Alamgir Hossain, Naijie Gu, Zhangjin Huang

Published: 01 Apr 2025, Last Modified: 12 Nov 2025DisplaysEveryoneRevisionsCC BY-SA 4.0
Abstract: Highlights•Innovative model for multi-modal reasoning.•Cross-Modal Spatial-Channel (CM-SC) attention mechanism.•Effective higher-order interaction capturing.•Scalable attention mechanism and facilitate seamless integration.•Improved computational efficiency and enhanced performance metrics.
Loading