Efficient and Robust Semantic Image Communication via Stable Cascade

Published: 06 Jun 2025, Last Modified: 30 Jun 2025ICML Workshop on ML4WirelessEveryoneRevisionsBibTeXCC BY 4.0
Keywords: Semantic Communication, Image Transmission, Generative AI, Diffusion Model
Abstract: Diffusion Model (DM) based Semantic Image Communication (SIC) systems face significant challenges, such as slow inference speed and generation randomness, that limit their reliability and practicality. To overcome these issues, we propose a novel SIC framework inspired by Stable Cascade, where extremely compact latent image embeddings are used as conditioning to the diffusion process. Our approach drastically reduces the data transmission overhead, compressing the transmitted embedding to just $0.29$\% of the original image size. It outperforms three benchmark approaches - the diffusion SIC model conditioned on segmentation maps (GESCO), the recent Stable Diffusion (SD)-based SIC framework (Img2Img-SC), and the conventional JPEG2000 $+$ LDPC coding - by achieving superior reconstruction quality under noisy channel conditions, as validated across multiple metrics. Notably, it also delivers significant computational efficiency, enabling over $3\times$ faster reconstruction for $512\times512$ images and more than $16\times$ faster for $1024\times1024$ images as compared to the approach adopted in Img2Img-SC.
Submission Number: 13
Loading