EDUSTT: In-Domain Speech Recognition for Nigerian Accented Educational Contents in English

Sharon Ibejih; Wuraola Fisayo Oyewusi; Olubayo Adekanmbi; Opeyemi Osakuade

EDUSTT: In-Domain Speech Recognition for Nigerian Accented Educational Contents in English

Sharon Ibejih, Wuraola Fisayo Oyewusi, Olubayo Adekanmbi, Opeyemi Osakuade

Published: 08 Apr 2022, Last Modified: 05 May 2023AfricaNLP 2022Readers: Everyone

Keywords: ASR, NeMo, Nigerian accent, Domain Specific ASR

TL;DR: Accent tuned automatic speech recognition for Nigerian educational content

Abstract: English Automatic Speech Recognition systems are trained on regular speech, therefore they may struggle to perform well on accented and domain-specific speech. For broader applications of ASR systems, such as in education, where there is synchronous learning, it is important to have a reliable system - a specialized system that recognises terms used in school subjects and spoken by accented teachers. English is our official language in Nigeria, and it is the major language used to teach in schools. However, our teachers hail from different parts of the country, where their mother-tongue affects the way they pronounce certain words. The aim of this paper is to propose an ASR system for education in Nigerian accent. Our experiment leveraged on fine tuning NeMo’s QuartzNet15x5 English model on our accented educational data. This process yielded a WER of 27\%.

1 Reply

Loading