mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and VideoDownload PDFOpen Website

Published: 01 Jan 2023, Last Modified: 30 Oct 2023ICML 2023Readers: Everyone
Abstract: Recent years have witnessed a big convergence of language, vision, and multi-modal pretraining. In this work, we present mPLUG-2, a new unified paradigm with modularized design for multi-modal pret...
0 Replies

Loading