PaSCoNT - Parallel Speech Corpus of Northern-central Thai for automatic speech recognition

Published: 01 Jan 2025, Last Modified: 15 Jun 2025Comput. Speech Lang. 2025EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Highlights•PaSCoNT is a parallel speech corpus of Northern-Central Thai recorded 100 h of speech.•PaSCoNT consists of 907,832 words and 6279 vocabulary items.•There were statistically significant differences speech tempo between Central Thai and Northern Thai.•ASR model using the PaSCoNT can be used for both Northern Thai and Central Thai dialect speech recognition.
Loading