Multi-CLIP: Contrastive Vision-Language Pre-training for Question Answering tasks in 3D Scenes

Published: 2023, Last Modified: 17 Nov 2025BMVC 2023EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Abstract
Loading