Automatic Transcription for Estonian Children's Speech

Published: 01 Jan 2023, Last Modified: 26 Sept 2024NoDaLiDa 2023EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: We evaluate the impact of recent improvements in Automatic Speech Recognition (ASR) on transcribing Estonian children’s speech. Our research focuses on fine-tuning large ASR models with a 10-hour Estonian children’s speech dataset to create accurate transcriptions. Our results show that large pre-trained models hold great potential when fine-tuned first with a more substantial Estonian adult speech corpus and then further trained with children’s speech.
Loading