Keywords: multiword expressions, PARSEME, linguistics, annotation
Working Group: WG1: Corpus annotation
WG1 Tasks: Task 1.2 on MWE annotation guidelines and UD-PARSEME unification
Abstract: PARSEME is an initiative bringing together researchers of multiword expressions (MWEs) - a community that develops unified guidelines enabling consistent annotation of text corpora across many languages. One of the key features of the PARSEME model developed within PARSEME is tractability, which notably means that the corpus must be easily usable in natural language processing. Our goal here, however, is to move beyond the perspective adopted so far and to put ourselves in the shoes of a linguist who uses computational resources, but does not necessarily create them: we establish a common ground by presenting a linguistic background of PARSEME and principles of the tests. We also verify whether a similar set of tests can be applied to both content and functional words. At the end, we talk about several linguistic studies in which the PARSEME corpus has been used.
Tracks For Type Of Contribution: Complete work (including previously published work)
Do You Need Visa To Attend The 4th UniDive General Meeting In Romania: No
Email Sharing: We authorize the sharing of all author emails with Program Chairs.
Data Release: We authorize the release of our submission and author names to the public in the event of acceptance.
Submission Number: 61
Loading