Ouroboros: Wafer-Scale SRAM CIM with Token-Grained Pipelining for Large Language Model Inference

Yiqi Liu, Yudong Pan, Mengdi Wang, Shixin Zhao, Haonan Zhu, Yinhe Han, Lei Zhang, Ying Wang

Published: 22 Mar 2026, Last Modified: 12 Mar 2026CrossrefEveryoneRevisionsCC BY-SA 4.0
Loading