Published: 01 Jan 2023, Last Modified: 30 Oct 2023ICML 2023Readers: Everyone
Abstract:Recent years have witnessed a big convergence of language, vision, and multi-modal pretraining. In this work, we present mPLUG-2, a new unified paradigm with modularized design for multi-modal pret...