Leveraging Large Vision-Language Model as User Intent-Aware Encoder for Composed Image Retrieval

Published: 01 Jan 2025, Last Modified: 15 May 2025AAAI 2025EveryoneRevisionsBibTeXCC BY-SA 4.0
Loading