TEAM MIPAL at MEDIQA-M3G 2024: Large VQA Models for Dermatological Diagnosis

Hyeonjin Kim, Min Kim, Jae Jang, KiYoon Yoo, Nojun Kwak

Published: 2024, Last Modified: 15 May 2025ClinicalNLP@NAACL 2024EveryoneRevisionsBibTeXCC BY-SA 4.0

Abstract: This paper describes the methods used for the NAACL 2024 workshop MEDIQA-M3G shared task for generating medical answers from image and query data for skin diseases. MedVInT-Decoder, LLaVA, and LLaVA-Med are chosen as base models. Finetuned with the task dataset on the dermatological domain, MedVInT-Decoder achieved a BLEU score of 3.82 during competition, while LLaVA and LLaVA-Med reached 6.98 and 4.62 afterward, respectively.