Ruminating Word Representations with Random Noise Masking

Hwiyeol Jo; Byoung-Tak Zhang

Ruminating Word Representations with Random Noise Masking

Hwiyeol Jo, Byoung-Tak Zhang

28 Sept 2020 (modified: 22 Jun 2025)ICLR 2021 Conference Blind SubmissionReaders: Everyone

Keywords: representation learning for natural language processing, pretrained word embeddings, iterative training method, model regularization

Abstract: We introduce a training method for better word representation and performance, which we call \textbf{GraVeR} (\textbf{Gra}dual \textbf{Ve}ctor \textbf{R}umination). The method is to gradually and iteratively add random noises and bias to word embeddings after training a model, and re-train the model from scratch but initialize with the noised word embeddings. Through the re-training process, some noises can be compensated and other noises can be utilized to learn better representations. As a result, we can get word representations further fine-tuned and specialized in the task. On six text classification tasks, our method improves model performances with a large gap. When GraVeR is combined with other regularization techniques, it shows further improvements. Lastly, we investigate the usefulness of GraVeR.

One-sentence Summary: An iterative method to be applied on pretrained word embeddings to find better word representations.

Code Of Ethics: I acknowledge that I and all co-authors of this work have read and commit to adhering to the ICLR Code of Ethics

Supplementary Material: zip

Community Implementations: [![CatalyzeX](/images/catalyzex_icon.svg) 1 code implementation](https://www.catalyzex.com/paper/ruminating-word-representations-with-random/code)

Reviewed Version (pdf): https://openreview.net/references/pdf?id=Z5gwxaHZCo

4 Replies

Loading