Detecting Annotation Scheme Variation in Out-of-Domain TreebanksDownload PDFOpen Website

2016 (modified: 08 Nov 2021)LREC 2016Readers: Everyone
Abstract: To ensure portability of NLP systems across multiple domains, existing treebanks are often extended by adding trees from interesting domains that were not part of the initial annotation effort. In this paper, we will argue that it is both useful from an application viewpoint and enlightening from a linguistic viewpoint to detect and reduce divergence in annotation schemes between extant and new parts in a set of treebanks that is to be used in evaluation experiments. The results of our correction and harmonization efforts will be made available to the public as a test suite for the evaluation of constituent parsing.
0 Replies

Loading