Efficient spatio-temporal modeling and text-enhanced prototype for few-shot action recognition

Published: 2025, Last Modified: 04 Nov 2025Neurocomputing 2025EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Highlights•Propose modules to enhance spatio-temporal modeling for few-shot action recognition tasks.•Temporal Enhancement Adaptation improves temporal feature extraction in videos.•Spatio-Temporal Fusion Adaptation integrates spatial and temporal features effectively.•Text-Enhanced Prototype Module fuses textual and visual data for better prototype quality.•Achieves competitive results on benchmarks with minimal trainable parameters.
Loading