CREAM: Coarse-to-Fine Retrieval and Multi-modal Efficient Tuning for Document VQA

Jinxu Zhang, Yongqi Yu, Yu Zhang

Published: 28 Oct 2024, Last Modified: 03 Jan 2026CrossrefEveryoneRevisionsCC BY-SA 4.0
Loading