# Adversarial Test Case Generation via Reinforcement Learning Extends Scaling Laws

## Overview

This repository contains the code for the ICLR 2026 submission 12230 - "Adversarial Test Case Generation via Reinforcement Learning Extends Scaling Laws".


## Setup

```bash
conda create -n tcs python=3.10
conda activate tcs
pip install -e .
pip install -r requirements.txt
MAX_JOBS=$(nproc) && pip install --use-pep517 flash-attn --no-build-isolation
```

## Training

```bash
bash scripts/deepseek-r1-code.sh --train-type dynamic_test_case --model-path deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B --train-files /path/to/file/taco_for_code_generation.pkl
```

* You need to download the dataset from [TACO_Train](https://huggingface.co/datasets/iclr12230/TACO_Train) and specify the path in the script.

* You can change the model path obtained from the stage one training and set the train_type to `adversarial_test_case` for the stage two training.

## Inference

`scripts/test_case_generation_trigger.sh` is a trigger script to get all the metrics (pass@1, tcs and reward model) reported in the paper.

```bash
bash scripts/test_case_generation_trigger.sh
```

* You need to download the TACO evaluation file in the required format from [TACO_Eval](https://huggingface.co/datasets/iclr12230/TACO_Eval), the LiveCodeBench evaluation file in the required format from [LiveCodeBench](https://huggingface.co/datasets/iclr12230/LiveCodeBench_Eval), and the corresponding test cases from [livecodebench/code_generation_lite](https://huggingface.co/datasets/livecodebench/code_generation_lite). Then, specify the paths in the script.

* We have provided the TCS fine-tuned 1.5B model at [iclr12230/TCS_1.5B](https://huggingface.co/iclr12230/TCS_1.5B/tree/main) and the 7B model at [iclr12230/TCS_7B](https://huggingface.co/iclr12230/TCS_7B/tree/main) for convenient use.



