Abstract: We provide a lexicon for text normalization of Indonesian colloquial words. We gathered 3,592 unique colloquial words-also known as “bahasa alay” -and manually annotated them with the normalized form. We built this lexicon from Instagram comments provided in [1].
0 Replies
Loading