Abstract: Highlights•Alleviating the imperfection of events by utilizing the spatiotemporal attention-based module.•Fully exploiting the cross-modal characteristics between frames and events.•Designing long-term motion information for global temporal information extraction.•Multi-scale optical flow losses of the proposed objective function.
Loading