Enhancing robust VQA via contrastive and self-supervised learning

Published: 01 Jan 2025, Last Modified: 11 Apr 2025Pattern Recognit. 2025EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Highlights•CSL-VQA combines self-supervised and contrastive learning for unbiased VQA training.•BSD distinguishes biased and unbiased samples, boosting generalization in VQA.•PNSG module balances dataset bias without manual annotation, enhancing VQA.•Achieves state-of-the-art 62.30% on VQA-CP v2 while preserving VQA v2 accuracy.
Loading