LCMA-Net: A light cross-modal attention network for streamer re-identification in live video

Jiacheng Yao, Jing Zhang, Hui Zhang, Li Zhuo

Published: 2024, Last Modified: 06 Nov 2025Comput. Vis. Image Underst. 2024EveryoneRevisionsBibTeXCC BY-SA 4.0

Abstract: Highlights•A light cross-modal attention network is built to complement heterogeneous features.•A RAMV-Softmax is proposed to promote the effect of Π-Net pre-training.•A lightweight cross-modal pooling attention is designed to align multimodal features.•We first establish a streamer re-ID pipeline based on multimodal deep learning.•We collect a real-world dataset StreamerReID for re-ID evaluation.