Deep Probabilistic Video Compression

Jun Han; Salvator Lombardo; Christopher Schroers; Stephan Mandt

Deep Probabilistic Video Compression

Jun Han, Salvator Lombardo, Christopher Schroers, Stephan Mandt

27 Sept 2018 (modified: 05 May 2023)ICLR 2019 Conference Blind SubmissionReaders: Everyone

Abstract: We propose a variational inference approach to deep probabilistic video compression. Our model uses advances in variational autoencoders (VAEs) for sequential data and combines it with recent work on neural image compression. The approach jointly learns to transform the original video into a lower-dimensional representation as well as to entropy code this representation according to a temporally-conditioned probabilistic model. We split the latent space into local (per frame) and global (per segment) variables, and show that training the VAE to utilize both representations leads to an improved rate-distortion performance. Evaluation on small videos from public data sets with varying complexity and diversity show that our model yields competitive results when trained on generic video content. Extreme compression performance is achieved for videos with specialized content if the model is trained on similar videos.

Keywords: variational inference, video compression, deep generative models

TL;DR: Deep Probabilistic Video Compression Via Sequential Variational Autoencoders

9 Replies

Loading