Developing Natural Language Processing Tools for Egyptian

Published: 27 May 2026, Last Modified: 27 May 2026UniDive 2026EveryoneRevisionsCC BY-SA 4.0
Keywords: Pre-Coptic Egyptian, treebank, multi-word expression, Pyramid Texts, Grew-match, parser.
Working Group: WG1: Corpus annotation, WG2: Lexicon-corpus interface, WG3: Multilingual and cross-lingual language technology, WG4: Quantifying and promoting diversity
Abstract: This paper describes the natural language processing tools for pre-Coptic Egyptian developed during the UniDive COST Action (2022–2026). These tools are the Universal Dependencies EPC treebank, the PARSEME corpus of Egyptian multi-word expressions, GrewPT, and a parser for pre-Coptic Egyptian sentences.
Tracks For Type Of Contribution: Work in progress
Do You Need Visa To Attend The 4th UniDive General Meeting In Romania: No
Email Sharing: We authorize the sharing of all author emails with Program Chairs.
Data Release: We authorize the release of our submission and author names to the public in the event of acceptance.
Submission Number: 2
Loading