Query Generation Using GPT-3 for CLIP-Based Word Sense Disambiguation for Image Retrieval

Published: 01 Jan 2023, Last Modified: 18 Apr 2024*SEM@ACL 2023EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: In this study, we propose using the GPT-3 as a query generator for the backend of CLIP as an implicit word sense disambiguation (WSD) component for the SemEval 2023 shared task Visual Word Sense Disambiguation (VWSD). We confirmed previous findings — human-like prompts adapted for WSD with quotes benefit both CLIP and GPT-3, whereas plain phrases or poorly templated prompts give the worst results.
Loading