Is Arabic Part of Speech Tagging Feasible Without Word Segmentation?Download PDFOpen Website

2010 (modified: 12 Nov 2022)HLT-NAACL 2010Readers: Everyone
Abstract: In this paper, we compare two novel methods for part of speech tagging of Arabic without the use of gold standard word segmentation but with the full POS tagset of the Penn Arabic Treebank. The first approach uses complex tags without any word segmentation, the second approach is segmention-based, using a machine learning segmenter. Surprisingly, word-based POS tagging yields the best results, with a word accuracy of 94.74%.
0 Replies

Loading