Vision-and-Language or Vision-for-Language? On Cross-Modal Influence in Multimodal TransformersDownload PDFOpen Website

2021 (modified: 17 Nov 2021)EMNLP (1) 2021Readers: Everyone
Abstract: Stella Frank, Emanuele Bugliarello, Desmond Elliott. Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing. 2021.
0 Replies

Loading