Corpora with Part-of-Speech Annotations for Three Regional Languages of France: Alsatian, Occitan and PicardDownload PDFOpen Website

2018 (modified: 12 Jan 2022)LREC 2018Readers: Everyone
Abstract: This article describes the creation of corpora with part-of-speech annotations for three regional languages of France: Alsatian, Occitan and Picard. These manual annotations were performed in the context of the RESTAURE project, whose goal is to develop resources and tools for these under-resourced French regional languages. The article presents the tagsets used in the annotation process as well as the resulting annotated corpora.
0 Replies

Loading