Supplementary data package for paper 'Semantically Informed Slang Interpretation'.

This data package contains slang definition entries with example usage sentences used to perform experiments in the paper. It is a subset of the Urban Dictionary dataset released by the paper "Learning to Explain Non-Standard English Words and Phrases" (Ke ni and William Yang Wang, 2017). Each row in the attached csv files include a slang's word form, its slang definition, and a usage context sentence with the slang expression replaced by a '[*SLANGAAAP*]' token. The data is splited into three partitions for training, development, and testing as described in the paper.

All data included are copyrighted by Urban Dictionary (https://www.urbandictionary.com/) and can only be used for research purposes. The original data license from Ni and Wang (2017) is attached below.

___________

[Original Data License from Ni and Wang (2017)]

Ke Ni, and William Yang Wang, "Learning to Explain Non-Standard English Words and Phrases", to appear in Proceedings of the 8th International Joint Conference on Natural Language Processing (IJCNLP 2017), short paper, Taipei, Taiwan, Nov.27-Dec.1, AFNLP.
========================================
Description of tsv train and test file:
Column 1: slang
Column 2: explanation
Column 3: example
Column 4: file id

========================================
The original sources retain the copyright of the data.

Note that there are absolutely no guarantees with this data,
and we provide this dataset "as is",
but you are welcome to report the issues of the preliminary version
of this data.

You are allowed to use this dataset for research purposes only.

For more question about the dataset, please contact:
William Wang, william@cs.ucsb.edu

