### Readme

This repository contains supplementary materials for the ICLR 2026 paper, "Long Grounded Thoughts: Distilling Compositional Visual Reasoning Chains at Scale."

Overview
This supplementary material provides key components of our data generation and verification pipeline. The goal is to offer a clear, reproducible look into the core assets used in our work, including prompt templates and examples of our grounded data.

Folder Structure
.
├── bonding-box-examples/
│   ├── train_00081_visualization.html
│   └── train_00644_visualization.html
├── data-example/
└── prompt-template/
    ├── caption_to_mcq_with_bbox_v2_object_centric.jinja2
    └── qwen3_verifier.py
Key Components
1. prompt-template/
This directory contains the core templates and scripts for our data synthesis pipeline.

caption_to_mcq_with_bbox_v2_object_centric.jinja2: This is the Jinja2 prompt template used in Stage 1 of our pipeline. It takes a dense caption and object metadata (including bounding boxes) as input to guide a Large Language Model in generating object-centric, grounded Multiple-Choice Questions (MCQs).

qwen3_verifier.py: This Python script implements the verification step in our pipeline. It uses the Qwen3 model to assess the logical and factual consistency of the generated MCQs and their corresponding reasoning traces, ensuring data quality at scale.

2. bonding-box-examples/
This directory provides HTML visualizations that demonstrate how our grounding mechanism works. Each file shows an image from our dataset overlaid with the bounding boxes generated by Grounded SAM 2. This is useful for qualitatively assessing the accuracy of the object metadata that serves as the foundation for our question generation.

3. data-example/
This directory contains a small, representative sample of the final synthesized data, including the generated MCQs and their corresponding long-form reasoning traces (Long Grounded Thoughts).

Usage
These components are provided to illustrate the main stages of our data generation framework as described in the main paper. The .jinja2 template can be used to replicate the MCQ generation process (Stage 1), while the verification script demonstrates a key part of our quality control pipeline. The examples offer a direct look at the final output of our method.