TreeSNNs: Temporal Resolution Ensembled SNNs for Neuromorphic Action Recognition

18 Sept 2025 (modified: 11 Feb 2026)Submitted to ICLR 2026EveryoneRevisionsBibTeXCC BY 4.0
Keywords: Spiking Neural Networks, Temporal Ensembling, Neuromorphic Action Recognition
Abstract: Spiking Neural Networks (SNNs) are energy-efficient due to sparse and asynchronous event processing, but their accuracy often lags behind that of conventional ANN deep learning models. Prior research has largely focused on novel SNN architectures and training methods, with continuous event streams typically binned into frames using either fixed event counts or fixed time intervals. In this paper, we observe that different motions exhibit distinct temporal dynamics and may be best captured at different temporal/event resolutions. Building on this insight, we propose TreeSNNs, an ensemble framework where model diversity is expressed via multiple event temporal resolutions. We utilize the Fano factor as a metric to quantify temporal dynamics and guide the selection of such a diverse set of temporal resolutions tailored to a given dataset. Individual SNNs trained at these resolutions are then aggregated through ensembling to improve recognition accuracy. Experiments on three neuromorphic action datasets—DVS Gesture, SL-Animals DVS, and the challenging THU$^{E\text{-}ACT}$-50 CHL—show that TreeSNNs consistently outperform baselines, improving accuracy by 1.05\%–6.8\%.
Primary Area: applications to neuroscience & cognitive science
Submission Number: 13608
Loading