WaveSense: Efficient Temporal Convolutions with Spiking Neural Networks for Keyword SpottingDownload PDF

Published: 28 Jan 2022, Last Modified: 13 Feb 2023ICLR 2022 SubmittedReaders: Everyone
Keywords: spiking, keyword spotting, temporal processing, streaming, audio, neuromorphic, wavenet, wavesense, always-on, low-power, temporal convolution
Abstract: Ultra-low power local signal processing is a crucial aspect for edge applications on always-on devices. Neuromorphic processors emulating spiking neural networks show great computational power while fulfilling the limited power budget as needed in this domain. In this work we propose spiking neural dynamics as a natural alternative to dilated temporal convolutions. We extend this idea to WaveSense, a spiking neural network inspired by the WaveNet architecture. WaveSense uses simple neural dynamics, fixed time-constants and a simple feed-forward architecture and hence is particularly well suited for a neuromorphic implementation. We test the capabilities of this model on several datasets for keyword-spotting. The results show that the proposed network beats the state of the art of other spiking neural networks and reaches near state-of-the-art performance of artificial neural networks such as CNNs and LSTMs.
One-sentence Summary: The paper proposes efficient temporal convolutions in spiking neural networks and reports SOTA accuracy on keyword spotting tasks.
10 Replies

Loading