Simple contrastive learning in a self-supervised manner for robust visual question answering

Published: 01 Jan 2024, Last Modified: 13 Nov 2024Comput. Vis. Image Underst. 2024EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Highlights•SCLSM mitigates the issue of language bias in VQA through self-supervised learning.•DCL enhances the focus of model on hard negatives by modifying the contrastive loss.•Our model can achieve comparable results for VQA debiasing tasks.
Loading