Parameter-efficient Tuning of a Pre-trained Model via Prompt Learning in Cross-modal Retrieval

Huaying Zhang, Rintaro Yanagi, Ren Togo, Takahiro Ogawa, Miki Haseyama

Published: 01 Jan 2023, Last Modified: 05 Nov 2023ICCE-Taiwan 2023Readers: Everyone

Abstract: One effective approach to improving the performance of cross-modal retrieval is to fine-tune a pre-trained cross-modal model. However, conventional fine-tuning approaches usually require plenty of computational resources. To alleviate such a requirement, we propose a parameter-efficient tuning method of a pre-trained model via prompt learning for cross-modal retrieval. Obtaining inspiration from the prompt learning technique in natural language processing, our method constructs a multidimensional vector as a prompt for the cross-modal retrieval, and the prompt with a few parameters is optimized to achieve better retrieval performance. We conducted experiments on the open dataset, and the results verify that our proposed method is effective and parameter-efficient.

0 Replies