Abstract: Highlights•Attention guiding is significant for video saliency prediction based on 3D CNN.•A spatiotemporally attentive 3D CNN for robust video saliency prediction is proposed.•An adaptive upsampling module for refining spatial features is proposed.•A frame-wise attention module for propagating temporal features is proposed.•The effectiveness of the proposed method is comprehensively evaluated.
Loading