Dual Precision Deep Neural NetworkDownload PDFOpen Website

Published: 01 Jan 2020, Last Modified: 15 May 2023CoRR 2020Readers: Everyone
Abstract: On-line Precision scalability of the deep neural networks(DNNs) is a critical feature to support accuracy and complexity trade-off during the DNN inference. In this paper, we propose dual-precision DNN that includes two different precision modes in a single model, thereby supporting an on-line precision switch without re-training. The proposed two-phase training process optimizes both low- and high-precision modes.
0 Replies

Loading