Inverted indexing for cross-lingual NLPDownload PDF

2015 (modified: 16 Jul 2019)ACL (1) 2015Readers: Everyone
Abstract: We present a novel, count-based approach to obtaining inter-lingual word representations based on inverted indexing of Wikipedia. We present experiments applying these representations to 17 datasets in document classification, POS tagging, dependency parsing, and word alignment. Our approach has the advantage that it is simple, computationally efficient and almost parameter-free, and, more importantly, it enables multi-source crosslingual learning. In 14/17 cases, we improve over using state-of-the-art bilingual embeddings.
0 Replies

Loading