Pseudo-labeling with keyword refining for few-supervised video captioning

Ping Li, Tao Wang, Xinkui Zhao, Xianghua Xu, Mingli Song

Published: 2025, Last Modified: 06 Feb 2025Pattern Recognit. 2025EveryoneRevisionsBibTeXCC BY-SA 4.0

Abstract: Highlights•A new task named few-supervised video captioning that uses only one human-sentence is introduced.•A pseudo labeling strategy with lexical constraint is proposed to augment knowledge.•A keyword-refined captioning module with video-text gated fusion is designed generating high-quality sentences by modeling global context.•Empirical studies demonstrate the satisfying quality of the generated captions by proposed method.