LanguageBind: Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment

Published: 01 Jan 2024, Last Modified: 21 May 2025ICLR 2024EveryoneRevisionsBibTeXCC BY-SA 4.0
Loading