Oscar: Omni-scale robust contrastive learning for Text-VQA

Published: 01 Jan 2024, Last Modified: 09 Apr 2025Expert Syst. Appl. 2024EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Highlights•A novel Omni-scale robust contrastive learning framework achieves SOTA.•A novel perception comprehension module extracts comprehensive image information.•Two novel contrastive learning approaches are proposed.•An answer generation module facilitates accurate prediction process.
Loading