Enhanced Interpretability in Learned Image Compression via Convolutional Sparse Coding

Yiwen Tu, Wen Tan, Youneng Bao, Genhong Wang, Fanyang Meng, Yongsheng Liang

Published: 2024, Last Modified: 10 Nov 2025ICME 2024EveryoneRevisionsBibTeXCC BY-SA 4.0

Abstract: Compared to traditional image compression methods, learned image compression (LIC) methods have demonstrated increasingly superior rate-distortion performance. However, LIC networks are often regarded as black boxes, still lacking a theoretical understanding. Sparse coding provides the sparse and interpretable modeling for analyzing or synthesizing natural images in various signal and image processing applications. Therefore, we introduce convolutional sparse coding (CSC) into transform network for enhancing the interpretability of LIC methods. In this paper, we first employ CSC layers to achieve certain theoretical modeling for LIC network, and adopt a weight sharing strategy in encoder-decoder pair and attention mechanism to balance the complexity and performance. Additionally, we analyze the model robustness against data input perturbations and consider the impact of sparsity trade-off parameter in the CSC layer optimization process. Experimental results demonstrate that our method achieves comparable performance with the corresponding baseline, and our model is more robust.