# Synthesizing Programmatic Reinforcement Learning Policies with Large Language Model Guided Search

Our implementation is based on [leaps](https://github.com/Tales-Carvalho/leaps.git) and [reclaiming](https://github.com/lelis-research/prog_policies.git)

## Dependencies

We recommend using `conda` to install the dependencies:

```bash
conda env create --name llm_gs_env --file environment.yml
```

If `conda` is not available, it is also possible to install dependencies using `pip` on **Python 3.8**:

```bash
pip install -r requirements.txt
```

After installing the environment, please export your **openai api key** to execute our main method:
```bash
export OPENAI_KEY="YOUR_API_KEY"
```

## Execution
To execute our main method and baselines. You can change **method** and **task** inside the scripts. **(LLM-GS is our main method.)**

```bash
bash scripts/run_main_results.sh
```

Please notice that the result of LLM-GS might not be exactly the same as the one we report on our paper due to the randomness of the LLMs.
