Diversity-aware Evaluation for Paraphrase PatternsOpen Website

2011 (modified: 16 Jul 2019)TextInfer@EMNLP 2011Readers: Everyone
Abstract: Common evaluation metrics for paraphrase patterns do not necessarily correlate with extrinsic recognition task performance. We propose a metric which gives weight to lexical variety in paraphrase patterns; our proposed metric has a positive correlation with paraphrase recognition task performance, with a Pearson correlation of 0.5~0.7 (k=10, with "strict" judgment) in a statistically significant level (p-value<0.01).
0 Replies

Loading