Improving DNN Inference Throughput Using Practical, Per-Input Compute Adaptation

Anand Padmanabha Iyer, Mingyu Guan, Yinwei Dai, Rui Pan, Swapnil Gandhi, Ravi Netravali

Published: 04 Nov 2024, Last Modified: 30 Jan 2026CrossrefEveryoneRevisionsCC BY-SA 4.0
Loading