Abstract: Highlights•We design the CT module and gain essential improvement of accuracy on DVS-Gesture.•We integrate spatial-temporal attention into SNN and demonstrate its advantages.•We propose a directly trained Transformer-based SNN, termed ”Spikeformer”.•We demonstrate the robustness and the superiority of Spikeformer.
Loading