Layerweaver+: A QoS-Aware Layer-Wise DNN Scheduler for Multi-Tenant Neural Processing UnitsDownload PDFOpen Website

2022 (modified: 15 May 2022)IEICE Trans. Inf. Syst. 2022Readers: Everyone
Abstract: Many cloud service providers employ specialized hardware accelerators, called neural processing units (NPUs), to accelerate deep neural networks (DNNs …
0 Replies

Loading