**Hardware requirement**: ≥8 H20 (or equivalent) GPUs for full-scale reproduction.

All relevant files are located under `./IntentRL/`:
- Workflow & prompts: `./IntentRL/workflow/`
- Training config: `./IntentRL/train.yaml`
- Data preparation scripts: `./IntentRL/data_prepare/`

## Prerequisites
- Python 3.10 or higher
- OpenAI APi key
- Gemini API key

## Installation
Install Trinity Framework and clone the IntentRL repository in `./examples` direction:
```bash
# Install from source
cd {PATH_TO_TRINITY}
pip install -e .
```

## Step 1. Prepare Datasets
Download the [DeepResearch Bench](https://deepresearch-bench.github.io/#)

Perform the following preprocessing steps to turn the samples in DeepResearch Bench into the format for our training:
```bash
cd ./examples/IntentRL
python data_prepare/1_generate_tree.py
python data_prepare/2_generate_trajectory.py
python data_prepare/3_generate_summary.py
python data_prepare/4_data_splt.py
python data_prepare/5_info_extract_pipeline.py
python data_prepare/6_build_dataset.py
```

## Step 2. Start Training
Update `train.yaml` with paths to:
- Your processed datasets,
- Base model,
- Checkpoint output directory,
- Judge model.

Then, launch training:
```bash
sh train.sh
```

## Step 3. Simulation
```bash
python clarify_simulate.py
```