MMBypass: Towards efficient multi-modal AI computing with adaptive bypass network

Yifei Pu, Xinfeng Xia, Xiaofeng Hou, Chi Wang, Cheng Xu, Jiacheng Liu, Jing Wang, Minyi Guo, Jingling Yuan, Chao Li

Published: 2025, Last Modified: 25 Jan 2026J. Parallel Distributed Comput. 2025EveryoneRevisionsBibTeXCC BY-SA 4.0

Abstract: Highlights•Propose a new neural network architecture for efficient multi-modal ai computing.•Design two lightweight modules for low execution latency while keeping accuracy.•Evaluate on real-world multi-modal tasks and reduce up to 44.5% execution latency.•Apply parallel computing to multi-modal models and compare the performance.