# Step 1: Process CSV files

We first process the data so that we only use binary features and so the data is stored as a numpy array.

Run `python process_csv.py` with the appropriate arguments.

# Step 2: Run the practioner's validation process experiment

We next simulate the experiment described in Figure 1 and Figure 7.

Run `python run_experiment.py` with the appropriate arguments for each dataset. Make sure that the data directory is the same as the output folder of Step 1. 

The output directory will contain the saved results of the experiment as well as graphs of the results.

# Step 3: Graph the aggregate results

Run `python graph_results.py` to plot the results of multiple experiments together in the same plot for comparison.
