Dissecting Multimodality in VideoQA Transformer Models by Impairing Modality Fusion.

Ishaan Singh Rawal, Alexander Matyasko, Shantanu Jaiswal, Basura Fernando, Cheston Tan

13 Nov 2024 (modified: 15 May 2025)ICML 2024EveryoneRevisionsCC BY-SA 4.0
Loading