MSCCL++: Rethinking GPU Communication Abstractions for AI Inference

Changho Hwang, Peng Cheng, Roshan Dathathri, Abhinav Jangda, Saeed Maleki, Madan Musuvathi, Olli Saarikivi, Aashaka Shah, Ziyue Yang, Binyang Li, Caio Rocha, Qinghua Zhou, Mahdieh Ghazimirsaeed, Sreevatsa Anantharamu, Jithin Jose

Published: 2026, Last Modified: 07 May 2026ASPLOS (2) 2026EveryoneRevisionsBibTeXCC BY-SA 4.0
Loading