# Creating the dataset

Perform the following steps to build the dataset:
    1. call ```python process_agiga.py <path-to-gigaword-directory> <path-to-output-dir>``` to preprocess the gigaword dataset into the right format.
    2. `cd` to the output directory and call ```remove_empty.py``` with p = "train", p = "dev" and p = "test" each.
