This directory contains all the Python pipelines related to preprocessing of prompts and postprocessing of outputs.

`generate_boxes_data_modified.py`: Modified data generation code with following changes:
* Initial State for all boxes is 2 objects.
* No "Remove" Operation. (Only Move/Put)
* `numops` up to 5 operations. 
* Added new fields: `initial_state`, `used_objects`, `used_boxes` (UNFINISHED)

The exact command for the current dataset is generated with is the following:

```commandline
generate_boxes_data_modified.py\
    --object_vocabulary_file "../data/objects/llama_friendly_objects.csv"\
    --output_dir "../data/boxes_noRemove_noEmpty_max5"\
    --alternative_forms "always"\
    --cft_formats "rand_obj_rand_query_id"\
    --exp_formats "all_enum"\
    --prompt_formats "instruct_pre_exp_1s"
```

- `alternative_forms` (str): specifies which split uses alternative form (reverses state description "Box 0 contains apple" to "Apple is in Box 0"). One of {train, test, always, never}
- `cft_formats` (str): comma separated list of counterfactuals to include. For each counterfactual format, you can choose from {`rand_obj`, `rand_query_id`, `rand_box_id`, `rand_obj_rand_query_id`, `rand_obj_rand_box_id`}. `rand_obj` means randomly map existing objects to a new set of unseen objects. `rand_box_id` means randomly shuffle all box ids (including query box id). `rand_query_id` means randomly map the query box id only. For example, you can specify it as `rand_query_id,rand_box_id`. This will output two counterfactuals per data point, one swapping query box id, one swapping all box ids. Couterfactuals will also be applied to prompts as well, so for every prompt, you get twice the amount of counterfactuals in this example.
- `exp_formats` (str): comma separated list of explanation format to include. For each explanation format, you can choose from {`all_list`, `query_list`, `all_enum`, `query_enum`, `all_text`, `query_text`}. `all` means at every state the description includes all boxes, whereas `query` only describes the query box state. `text` describes the states in natural language (similar to initial state description). `enum` enumerate line by line the states. `list` prints out a python like list object. 
- `prompt_formats` (str): comma separated list of prompt format to include. For each prompt format, the schema is `{instruct_}{no/pre/post}_exp{_1s}{_completed}`. `instruct_` indicates whether instruction string is prefixed to the prompt. `no/pre/post_exp` indicates whether the explanation is included, before, or after the prediction. `1s` indicates whether the 1-shot demonstration is included. `completed` means whether ground-truth explanation and prediction is filled in (you want this for circuit tracing but not for generation eval). The total number of prompt is multiplied by the number of explanation format you provide (e.g. if you have `all_list,query_enum` for `exp_formats` and `pre_exp_1s,pre_exp_1s_completed`, you will end up with 4 prompts in the end)
