# SkyRL-v0

# Evaluation
We report evaluation results of different downstream tasks as below. 

## SWE-Bench
We report the evaluation result on SWE-Bench-Verified below.

| Model              | Base                 | Base Performance | Performance | Training Time |
|--------------------|----------------------|------------------|-------------|---------------|
| SkyRL-Agent-7B-v0  | OpenHands-7B-Agent   | 11%              | 14.6%       | 16hrs 8xH100  |
| SkyRL-Agent-8B-v0  | Qwen3-8B no thinking | 3.6%             | 9.4%        | 27hrs 8xH200  |
| SkyRL-Agent-14B-v0 | Qwen3-14B thinking   | 18%              | 21.6%       | 20hrs 8xH200  |


# Reproduction Scripts

We are actively working on providing updated scripts for SkyRL-v0. Stay tuned! ⚠️⚠️