Revealing Long-term Language Change with Subword-incorporated Word Embedding ModelsOpen Website

2019 (modified: 05 Nov 2021)CogSci 2019Readers: Everyone
Abstract: We propose an augmented word embedding model that better incorporates subword information with additional parameters that characterize the semantic weights of characters in composing words. Our model can reveal some interesting patterns of long-term change in Chinese language, which provides novel evidence and methodology that enriches existing theories in evolutionary linguistics. The resulting word vectors also has decent performance in NLP-related tasks.
0 Replies

Loading