Abstract: Highlights•The GG-SMM is proposed to represent short-term temporal motion clues.•The SG-LMM is used to motivate long-term temporal and channel motion features.•The SMAM is designed to focus on the spatial motion-sensitive regions.•This is the first attempt to perform attention mechanism for all dimensions of video.
Loading