
Official Codebase of RepoNavigator

This codebase is built upon verl.


```shell
conda create -n navigator python=3.11
conda activate navigator
pip install --no-deps -e .

```


The repos provided by SWE-bench are stored in dockerhub. To enable efficient training, we pull them all before training starts.
```shell
python scripts/download_repos.py
```
The downloaded repos (```./repos_smith_gold```, ```./repos_verified```, and ```./repos_pro```) will be stored in the root directory.


Before training, you need to organize the datasets into parquet files by runing the following commands:
```shell
python preprocess/create_dataset_swe_pro_function_filter.py
python preprocess/create_dataset_swe_smith_function_filter.py
python preprocess/create_dataset_swe_verified_function_filter.py
```
⚠️ The full repos are about 30GB, please make sure you have enough space on your device.

Now, please input your WANDB_API_KEY to visualize your training:
```shell
export WANDB_API_KEY='your-wandb-api-key'
```

We provide the training scripts in ```./training ```:
```shell
bash run_xxxxxxxxx.sh
```
Noted, if you want to run the 14B and 32B training, you are suggested to run on at least 2 nodes with 16 80G GPUs in total. Before running the scripts, you must start the ray cluster. For the 7B model, 8 80G gpus are enough, and you can run the script directly.

For evaluation, you can directly run the training scripts while changing "val_only" to True.
The results will be saved in ``` ./results ```, you can refer to ```scripts/evaluate_jsonl.py``` to calculated the metrics.



Due to the size limit of github, we can not provide all the files, and you can download them using the provided scipts.


We will release our checkpoints when our paper is accepted.


Yes, you can achieve it by runing ```scripts/evaluate_jsonl.py```, you can also modify this part to calculate more metrices.


If you face any issues, you can refer to the official repository of verl.
