Part-of-speech tagging for Arabic tweets using CRF and Bi-LSTM

Published: 01 Jan 2021, Last Modified: 20 Feb 2025Comput. Speech Lang. 2021EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Highlights•POS taggers are developed for MSA and GLF variants of the Arabic language using CRF and BiLSTM.•The gold standard annotated datasets that have been constructed for POS tagging are made accessible to the research community.•An exploratory analysis of the behavior of using hashtags in Arabic tweets is presented, and this can be leveraged in future studies.•The POS tagger for Arabic tweets using the BiLSTM achieves the best performance.•Experiments show that there is no need for a dialect specific POS tagger.
Loading