Spatial-temporal multi-scale interaction for few-shot video summarization

Qun Li, Zhuxi Zhan, Yanchao Li, Bir Bhanu

Published: 2025, Last Modified: 12 May 2025Eng. Appl. Artif. Intell. 2025EveryoneRevisionsBibTeXCC BY-SA 4.0

Abstract: Highlights•We propose a new task, called few-shot video summarization.•We propose a baseline for few-shot video summarization, namely STeMI.•STeMI endeavors to extract spatial-temporal representations by using a dual-branch.•We adapt several SOTAs for video summarization to few-shot scenarios.•We conduct comparative evaluations against our proposed STeMI model.