## Dependency
- Python 3.9
- openai
- tree_sitter
- codebleu
  
## Dataset

We store the CoNaLa dataset in the `conala` folder. The dataset is stored in the `json` format. Each entry in the dataset is a dictionary.

## Empirical Study

Before running the study, you need to fill in the Openai API key in the line 28 of `generator_llm.py`.

## Generating ERs in the open setting

To generate ERs in the open setting, run the following command:

```bash
python generator_llm.py \
    --dataset conala \
    --model gpt-4o \
    --task open_domain
```

## Generating ERs in the constrained setting

To generate ERs in the constrained setting, run the following command:

```bash
python generator_llm.py \
    --dataset conala \
    --model gpt-4o \
    --task code2comment # or code2pseudocode, code2flow
```