Prediction of Depression Severity Based on Transformer Encoder and CNN ModelDownload PDFOpen Website

Published: 01 Jan 2022, Last Modified: 11 May 2023ISCSLP 2022Readers: Everyone
Abstract: Depression is one of the most common mental health disorders and can lead to suicide in extreme cases. It is crucial to develop and design an automatic depression detection(ADD) model that is objective, effective, and robust. Many works have been done to develop automatic depression detection systems using speech data due to its easy accessibility and privacy. Although most current methods use RNN, it is difficult to capture the long-term dependence of depressed audio due to the gradient explosion/vanish. In this paper, a novel model combining Transformer Encoder and CNN is proposed to address this problem. Experiments on the DAIC-WOZ dataset show that the performance of the algorithm has improved considerably, with a mean absolute error (MAE) of 4.48 and a root mean square error (RMSE) of 5.37.
0 Replies

Loading