Representation learning strategies to model pathological speech: Effect of multiple spectral resolutions

Published: 01 Jan 2024, Last Modified: 20 May 2025Comput. Speech Lang. 2024EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Highlights•A novel auto-encoder fusion strategy is proposed to assess Parkinson’s dysarthria.•Strategy leverages varied spectral resolutions of speech as basis for a feature set.•Framework fuses bottleneck features and reconstruction error of varied spectra.•Proposed approach classifies Parkinson’s with 97% accuracy using cross-validation.•Output highly correlates with clinically assessed dysarthria ratings (ρ=0.79).
Loading