Extracting coordinate word pairs for dependency parsingDownload PDFOpen Website

2015 (modified: 17 Nov 2021)IALP 2015Readers: Everyone
Abstract: The subtask of identifying coordinate structures in Chinese dependency analysis is a challenging problem. The accuracy of coordinate word recognition remains below the average. To address this problem, we propose an automatic identification method based on large-scale unlabeled corpus. We then integrate a set of new features corresponding to the collected word pairs into the dependency parser. Specifically, our proposed method is based on the presence of easy-to-identify coordinate fragments. Our method can be divided into two steps. In the first step, we leverage two hand-crafted rules to extract highly accurate coordinate word pairs as seed words. The second step is to utilize seed words to extract coordinate structures in the corpus for further use of coordinate word pair extraction. Experimental results show that the extracted coordinate word pairs can significantly improve the accuracy on coordinate structure dependency analysis.
0 Replies

Loading