Development of Assamese Speech Corpus and Automatic Transcription Using HTK

Himangshu Sarma; Navanath Saharia; Utpal Sharma

Development of Assamese Speech Corpus and Automatic Transcription Using HTK

Himangshu Sarma, Navanath Saharia, Utpal Sharma

Published: 01 Jan 2014, Last Modified: 09 Feb 2025SIRS 2014EveryoneRevisionsBibTeXCC BY-SA 4.0

Abstract: Exact pronunciation of words of a language is not found from the written form of the language. Phonetic transcription is a step towards the speech processing of a language. For a language like Assamese it is most important because it is spoken differently in different regions of the state. In this paper we report automatic transcription of Assamese speech using Hidden Markov Model Tool Kit (HTK). We obtain accuracy of 65.26 an experiment. We transcribed recorded speech files using IPA symbols and ASCII for automatic transcription. We used 34 phones for IPA transcription and 38 for ASCII transcription.

Loading