# CAD Agent

Pipeline for text-to-CAD: clarification, misleading-prompt generation, batch labeling, and CadQuery code generation/evaluation. Set `OPENAI_API_KEY` and optional path env vars (`DATA_ROOT`, `TOOLS_DIR`, etc.) as needed.

**Environment:** Use the same Dockerfile as [cadrille](https://github.com/col14m/cadrille).

---

### Main scripts


| **clarification_pipeline_parallel.py** | Full pipeline: detect misleading prompts, generate clarifying questions, answer them, then generate corrected CadQuery code; runs in batches. |
| **batch_label.py** | Batch script to generate LLM labels and evaluate CadQuery code (e.g. Chamfer distance) over train/val/test. |
| **batch_label_agent.py** | Batch generation of misleading CAD descriptions with multiple ambiguity types (K=1,2,3). |
| **batch_label_agent_type2.py** | Same as above but for direct-conflict ambiguity (same feature, two conflicting values). |
| **batch_code_leakage_check.py** | Checks whether modified prompts leak raw CadQuery code from ground truth. |
| **convert_misleading_to_sft.py** | Converts misleading samples into SFT conversation format (questions/answers or clean prompt). |
| **test.py** | Runs CAD code-generation evaluation: sample prompts, generate code, compute metrics (Chamfer, etc.). |

### `config/`

Model and agent settings plus prompt templates: **config.yaml**; **clarification.py**, **code_generation.py**, **code_leakage_check.py** for clarification/code-gen/leakage prompts; **misleading_prompt.py**, **ambiguity_under_specified.py**, **direct_conflict_same_feature_two_values.py** for misleading-prompt types; **prompt_comparison.py**, **prompt_verification.py**, **prompts.py** for comparison and verification.

### `src/`

| File | Purpose |
|------|--------|
| **ask_agent.py** | AskAgent: detects ambiguous/misleading CAD prompts and generates clarifying questions. |
| **inference.py** | LLM wrapper (API and local transformer) used by the pipeline. |
| **evaluation.py** | Metrics (Chamfer distance, etc.) and code extraction for evaluation. |
| **mesh_utils.py** | Load/normalize meshes, point-cloud sampling, CadQuery-to-mesh. |
| **data_loader.py** | Load pickle/JSON data and filter samples. |
| **preprocessing.py** | Build meshes from DeepCAD JSON; used for data prep. |
| **processing.py** | Filter samples by Chamfer criteria and convert to SFT format. |
| **user.py** | User-simulation for prompt generation (expertise levels, controlled errors). |
| **visualization.py** | Mesh rendering (e.g. matplotlib) for headless use. |

### `dataset/`

Data artifacts: **train_val_test.json** (splits), **improved.json** (improved prompts), **selected_misleading_samples_*** (train/test misleading samples).
