Facilitating Treebank Annotation Using a Statistical Parser

Fu-Dong Chiou, David Chiang, Martha Palmer

2001 (modified: 16 Jul 2019)HLT 2001Readers: Everyone

Abstract: Corpora of phrase-structure-annotated text, or treebanks, are useful for supervised training of statistical models for natural language processing, as well as for corpus linguistics. Their primary drawback, however, is that they are very time-consuming to produce. To alleviate this problem, the standard approach is to make two passes over the text: first, parse the text automatically, then correct the parser output by hand.

0 Replies