OSAMamba: An Adaptive Bidirectional Selective State-Space Model for OSA Detection

Chengjian Li, Zhenghao Shi, Na Li, Zhenzhen You, Yitong Zhang, Xiaoyong Ren, Xinhong Hei, Haiqin Liu

Published: 01 Jan 2025, Last Modified: 25 Sept 2025IEEE Trans. Instrum. Meas. 2025EveryoneRevisionsBibTeXCC BY-SA 4.0

Abstract: As the two most typical classic network models, convolutional neural networks (CNNs) and Transformer have been widely applied in obstructive sleep apnea (OSA) detection in recent years. However, due to the inherent limitations of the receptive field in traditional CNN models (the receptive field is positively correlated with the fixed convolutional kernel size, and the ability to extract global feature information is limited), further improvement in their performance is constrained. While, for the Transformer, due to the computational complexity of the self-attention mechanism in the Transformer model increases exponentially with the length of the context, it will hold a very high computational overhead, and which would hinder the deployment of the Transformer on devices with limited computing resources. To address these problems, this article proposes an adaptive bidirectional selective state-space model (ABSM)-based method for OSA detection, termed as OSAMamba. The main novelty of the proposed method lies in the following two aspects: the development of the lightweight multiscale efficient aggregation (LMSEA) module and the propose of ABSM. To achieve the purpose of expanding the model receptive field and capturing the effective temporal features with a very low number of parameters, the LMSEA module adopts a combination of partial convolution (PConv)-based multiscale strategy and convolutional block attention module (CBAM). The purpose of the ABSM module is to reduce the computational cost of the model and improve the model deployability by using a frequency-domain enhancement strategy to fuse the effective time-domain features extracted by adaptive bidirectional Mamba (ABi-Mamba) with linear complexity with the frequency-domain features extracted by the frequency-domain enhancement module (FEM). Extensive experiments on the Apnea-electrocardiogram (ECG) dataset show that of all compared methods, the proposed method obtains the best accuracy of 91.91% in the per-segment detection, and which surpasses the state-of-the-art (SOTA) TFFormer by 0.31%. It also achieves a remarkable accuracy of 100% with the lowest mean absolute error (MAE) of 2.43 in per-record detection.