A Context-Enhanced Generate-then-Evaluate Framework for Chinese Abbreviation Prediction

Hanwen Tong, Chenhao Xie, Jiaqing Liang, Qianyu He, Zhiang Yue, Jingping Liu, Yanghua Xiao, Wenguang Wang

2022 (modified: 27 Dec 2022)CIKM 2022Readers: Everyone

Abstract: As a popular form of lexicalization, abbreviation is widely used in both oral and written language and plays an important role in various Natural Language Processing applications. However, current approaches cannot ensure that the predicted abbreviation preserves the meaning of its full form and maintains fluency. In this paper, we introduce a fresh perspective to evaluate the quality of abbreviations within their textual contexts with pre-trained language model. To this end, we propose a novel two-stage generate-then-evaluate framework enhanced by context, which consists of a generation model to generate multiple candidate abbreviations and an evaluation model to evaluate their quality within their contexts. Experimental results show that our framework consistently outperforms all the existing approaches, achieving 53.2% [email protected] performance with a 5.6 points improvement compared to its previous best result. Our code and data are publicly available at https://github.com/HavenTong/CEGE.

0 Replies