Efficiently Gluing Pre-Trained Language and Vision Models for Image Captioning

Published: 2024, Last Modified: 27 Jan 2026ACM Trans. Intell. Syst. Technol. 2024EveryoneRevisionsBibTeXCC BY-SA 4.0
Loading