ANT: Adapt Network Across Time for Efficient Video Processing

Feng Liang, Ting-Wu Chin, Yang Zhou, Diana Marculescu

2022 (modified: 17 Nov 2022)CVPR Workshops 2022Readers: Everyone

Abstract: Abundant redundancies exist in video streams, thereby pointing to opportunities to save computations. Towards this end, we propose the Adaptive Network across Time (ANT) framework to harness these redundancies for reducing the computational cost of video processing. Unlike most dynamic networks that adapt their structures to different static inputs, our method adapts networks along the temporal dimension. By inspecting the semantic differences between frames, the proposed ANT chooses a purpose-fit network at test time to reduce overall computation, i.e., switching to a smaller network when observing mild differences. The proposed ANT adapts the structured networks within a supernet, making it hardware-friendly and there-fore achieves actual acceleration in real-world scenarios. The proposed ANT is powered by (1) a fusion module that utilizes the past features and (2) a dynamic gate to adjust the network in a predictive fashion with negligible extra cost. To ensure the generality of each subnet and the gate’s fairness, we propose a two-stage training scheme. We first train a weight-sharing supernet and then jointly train fusion modules and gates. Evaluation of the video detection task with the modern EfficientDet reveals the effectiveness of our approach.

0 Replies