# Evaluation
This section contains scripts for parsing the logs of the evaluation and validation harnesses and determining:
* (For validation) Whether a task instance is usable as an evaluaton instance
* (For evaluation) The performance of a generation relative to the gold patch output.

## Directory Layout