Explainable Knowledge reasoning via thought chains for knowledge-based visual question answering

Published: 01 Jan 2024, Last Modified: 19 May 2025Inf. Process. Manag. 2024EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Highlights•We investigate the role of multimodal CoT distilled knowledge from LLMs in KBVQA tasks.•We present a Multimodal Knowledge Reasoning via CoT (MuKCoT) model for reasoning and explanation.•Performance of MuKCoT fosters the higher-quality explanations for the KBVQA tasks.
Loading