Universal Low Bit-Rate Speech Steganalysis Integrating Domain-Specific and Domain-Shared Knowledge

Published: 2025, Last Modified: 05 Apr 2026IEEE Trans. Dependable Secur. Comput. 2025EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Universal low bit-rate speech steganalysis is a cutting-edge research task addressing real-world application needs and has garnered significant attention recently. However, the existing methods are still inadequate in extracting available information from various steganographic domains and fail to deliver interpretable forensic results for specific steganographic domains containing embedded information. In view of this, we present a novel universal low bit-rate speech steganalysis approach that seamlessly combines domain-specific and domain-shared information, enabling comprehensive and effective speech steganography detection. This approach comprises two vital components: the Matching Identification Network (MIN) and the Content Alignment Network (CAN). The MIN incorporates three effective separable backbones for capturing informative domain-specific embeddings, inherently unveiling local and global dependencies. In this network, we also design a cross-domain matching module to establish correlations among steganographic domains, thereby enhancing detection performance through multi-domain collaboration and facilitating effective forensics for the embedded domains. Moreover, the CAN acquires more informative domain-shared embeddings by using a metric learning-based Siamese architecture to process pairs of naive and recompressed speech samples. Experimental results demonstrate that the presented method not only significantly surpasses the existing universal steganalysis methods, but also competes with or even surpasses dedicated steganalysis methods in certain cases. In addition, our method can provide accurate forensic results regarding the existence of hidden information within each steganographic domain without relevant supervisory information, marking a significant milestone in pursuit of speech steganalysis.
Loading