Toggle navigation
OpenReview
.net
Login
×
Go to
IEICETD 2022
homepage
Layerweaver+: A QoS-Aware Layer-Wise DNN Scheduler for Multi-Tenant Neural Processing Units
Young H. Oh
,
Yunho Jin
,
Tae Jun Ham
,
Jae W. Lee
2022 (modified: 15 May 2022)
IEICE Trans. Inf. Syst. 2022
Readers:
Everyone
Abstract:
Many cloud service providers employ specialized hardware accelerators, called neural processing units (NPUs), to accelerate deep neural networks (DNNs …
0 Replies
Loading