Tigrinya Dialect IdentificationDownload PDF

Published: 03 Mar 2023, Last Modified: 16 Apr 2023AfricaNLP 2023Readers: Everyone
Keywords: Tigrinya, dialect identification, NLP
TL;DR: This paper describes dialect identification of Tigrinya from text using machine learning.
Abstract: Dialect Identification is an important topic of research in Natural Language Processing (NLP) as it has broad implications in many real-world applications such as machine translation, speech recognition and chatbots to name a few. In this work, we investigate Tigrinya dialect identification using machine learning techniques. To that end, we have identified three Tigrinya dialects, namely: Z, L and D. Then we systematically collected datasets for each dialect. Finally, we perform experiments using classical machine learning and deep learning methods to quantify effectiveness of current methods on the problem of Tigrinya dialect identification. The highest overall accuracy of 92.98\% was achieved using character-level Convolutional Neural Networks (CNNs).
0 Replies

Loading