Unbiased VQA via modal information interaction and question transformation

Published: 01 Jan 2025, Last Modified: 01 Mar 2025Pattern Recognit. 2025EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Highlights•We propose a unbiased VQA method IEQM to solve the language prior problem.•We design two modules ETV and INV to generate two high-level visual features.•We design a question Converter that converts original question into three sub-questions.•We design a joint training method to jointly train IAE and QC modules.
Loading