Abstract: Highlights•A multi-modal few-shot image recognition approach with superior performance.•Introduced a novel multi-scale interaction module for semantic–visual features.•Proposed a similarity measurement module combining diverse methods for FSL.•Achieved superior performance on four benchmarks in 1-shot and 5-shot settings.
External IDs:doi:10.1016/j.imavis.2025.105490
Loading