Building a Contact Language Treebank with UniDive Support: The Asia Minor Greek in Contact (AMGiC) UD Resource
Keywords: Universal Dependencies, treebank, language contact, Asia Minor Greek, Cappadocian, contact-induced morphosyntactic phenomena, endangered languages
Working Group: WG1: Corpus annotation, WG4: Quantifying and promoting diversity
WG1 Tasks: Task 1.1: Linguistic typology and multilingual corpus annotation, Task 1.4: Sharing tools, formats, and infrastructure
WG4 Tasks: Task 4.1: Promoting low-resourced/endangered languages
Tracks For Type Of Contribution: Work in progress
Abstract: The UniDive COST Action has provided a framework for advancing Universal Dependencies resources for typologically diverse and under-documented languages. We report on how UniDive activities—Short-Term Scientific Missions, working group discussions, and training schools—have directly contributed to the development of the Asia Minor Greek in Contact (AMGiC) treebank, a UD resource for annotating contact-induced morphosyntactic phenomena in Asia Minor Greek varieties. The treebank currently contains 72 annotated sentences (851 tokens) covering Silliot and Cappadocian (Delmesó subdialect) Greek, with a systematic taxonomy of contact phenomena encoded in the CoNLL-U MISC field and sociodemographic metadata enabling quantitative sociolinguistic analysis.
Do You Need Visa To Attend The 4th UniDive General Meeting In Romania: Yes
Email Sharing: We authorize the sharing of all author emails with Program Chairs.
Data Release: We authorize the release of our submission and author names to the public in the event of acceptance.
Submission Number: 69
Loading