HiLINK: Hierarchical linking of context-aware knowledge prediction and prompt tuning for bilingual knowledge-based visual question answering

Published: 01 Jan 2025, Last Modified: 14 Sept 2025Knowl. Based Syst. 2025EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Highlights•Simplify the two-stage training into an end-to-end structure for efficiency.•Enables relationships learning via Bayesian network-based contextual awareness.•Facilitates bilingual representation learning via a trainable encoder strategy.•Exhibits superior training effectiveness in a bilingual setting over monolingual.•HiLINK shows outstanding performance on BOK-VQA in all language settings.
Loading