hls4ml: An Open-Source Co-Design Workflow to Empower Scientific Low-Power Machine Learning Devices

Farah Fahim; Benjamin Hawks; Christian Herwig; James Hirschauer; Sergo Jindariani; Nhan Tran; Luca Carloni; Giuseppe Di Guglielmo; Philip Harris; Jeffrey Krupa; Dylan Rankin; Manuel Blanco Valentin; Josiah Hester; Yingyi Luo; John Mamish; Seda Memik; Thea Aarrestad; Hamza Javed; Vladimir Loncar; Maurizio Pierini; Adrian Alan Pol; Sioni Summers; Javier Duarte; Scott Hauck; Shih-Chieh Hsu; Jennifer Ngadiuba; Mia Liu; Duc Hoang; Edward Kreinar; Zhenbin Wu

hls4ml: An Open-Source Co-Design Workflow to Empower Scientific Low-Power Machine Learning Devices

Published: 07 Feb 2021, Last Modified: 05 May 2023tinyML 2021 RegularReaders: Everyone

Keywords: hls4ml, machine learning, neural networks, tinyML, FPGA, ASIC, low-power, low-latency

TL;DR: We present hls4ml, an open-source software-hardware co-design workflow to translate machine learning algorithms for implementation in FPGAs and ASICs to support science.

Abstract: Accessible machine learning algorithms, software, and diagnostic tools for energy-efficient devices and systems are extremely valuable across a broad range of application domains. In scientific domains, real-time near-sensor processing can drastically improve experimental design and accelerate scientific discoveries. We have developed hls4ml, an open-source software-hardware co-design workflow to interpret and translate machine learning algorithms for implementation in FPGAs and ASICs specifically to support domain scientists. In this paper, we describe the essential features of the hls4ml workflow including network optimization techniques---such as pruning and quantization-aware training---which can be incorporated naturally into the device implementations. We expand on previous hls4ml work by extending capabilities and techniques towards low-power implementations and increased usability: new Python APIs, quantization-aware pruning, end-to-end FPGA workflows, long pipeline kernels for low power, and new device backends include an ASIC workflow. Taken together, these and continued efforts in hls4ml will arm a new generation of domain scientists with accessible, efficient, and powerful tools for machine-learning-accelerated discovery.

5 Replies

Loading