# read me file for this project

for dataset ,download the c4 realnews like subset 

- c4-train.00000-of-00512.json
- c4-train.00000-of-00512.json
- and put them in the dataset folder or places you want and change the directoryin our code which uses them(and do remember to change directory for other code as well )

For training our network ,please refer to  `train_selector.py` and `train.py`, it is okey to just run `train.py` after changing the directory of the models and dataset used since we didn't include it in our zip 

an example of applying and detecting our watermark is in `generate.ipynb` , and you can find code for our experiments in `detect.py` ,`eval_ppl.py`,`eval\paraphrase.py`,`ablation.py`,`pos_tokens.ipynb` and so on... the results are saved in json files, which we also included in this zip

the code for evaluating the experiments datas are in python scripts in `eval \`

