# 🧑‍💻 SWE Bench
The data collection scripts, validation + evaluation harness, and experiment results for the SWE-Bench software engineering dataset described in the paper "SWE-bench: Evaluating Language Models at Resolving Real-World GitHub Issues"

Mirror clones of target repositories found under [`swe-bench` organization](https://github.com/orgs/swe-bench/repositories)

## 👋 Overview
SWE-Bench is a benchmark for evaluating large language models on our formulation of software engineering as a task. Given a large codebase and a problem statement, a language + code agent is tasked with accurately editing the correct files to solve the given question. The mode of 

## 🚀 Set Up
To build SWE-Bench from source, follow these steps:
1. Clone this repository locally
2. `cd` into the repository.
3. Run `conda env create -f environment.yml` to created a conda environment named `swe-bench`
4. Activate the environment with `conda activate swe-bench`