Published: 01 Jan 2023, Last Modified: 02 Sept 2023ICML 2023Readers: Everyone
Abstract:Real-world data contains a vast amount of multimodal information, among which vision and language are the two most representative modalities. Moreover, increasingly heavier models, e.g., Transforme...