# FloorplanQA

A benchmark dataset and evaluation framework for testing Large Language Models (LLMs) on spatial reasoning tasks using 2D room layouts. The system generates question-answer pairs from room floorplans that require geometric computations, pathfinding, and spatial understanding.

## Overview

FloorplanQA provides a comprehensive suite of spatial reasoning tasks:

| Task | Description | Answer Type |
|------|-------------|-------------|
| **Pair Distance** | Calculate Euclidean distance between object centroids | Float (meters) |
| **Free Space** | Compute total unoccupied floor area | Float (m²) |
| **Max Box** | Find largest rectangle that fits in free space | Float (m²) |
| **Obstruction** | Identify objects intersecting a line between two objects | List of labels |
| **Shortest Path** | Find shortest path between objects avoiding obstacles | List of (x,y) waypoints |
| **View Angle** | Compute angle between object-to-object vector and north | Float (degrees) |
| **Placement** | Check if a new object can fit in the room | Boolean |
| **Repositioning** | Calculate max distance an object can slide in a direction | Float (meters) |

## Installation

### Requirements

- Python 3.10+
- Dependencies:

```bash
pip install numpy pandas shapely matplotlib python-fire google-generativeai python-dotenv
```

### Environment Setup

For layout generation with Gemini:

```bash
# Create .env file in project root
echo "GEMINI_API_KEY=your_api_key_here" > .env
```

## Usage

### Generating QA Pairs

All scripts use Python Fire CLI. Run from the repository root:

```bash
# Generate QA pairs for specific tasks
python scripts/run_placement.py --input_dir=data/hssd_data/new_format
python scripts/run_shortest_path.py
python scripts/run_free_space.py
python scripts/run_obstruction.py
python scripts/run_view_angle.py
python scripts/run_max_box.py
python scripts/run_pair_distance.py
python scripts/run_repositioning.py
```

### Common Arguments

| Argument | Default | Description |
|----------|---------|-------------|
| `--input_dir` | `data/hssd_data/new_format` | Directory containing room JSON files |
| `--output_csv` | `benchmark/{task}/{task}_qa_hssd_data.csv` | Output CSV path |
| `--output_img` | `benchmark/{task}/{task}_qa_hssd_images/` | Output image directory |
| `--enable_subsampling` | `False` | Enable room type subsampling |
| `--bedrooms_count` | `80` | Max bedroom samples when subsampling |
| `--living_rooms_count` | `80` | Max living room samples when subsampling |
| `--kitchens_count` | `80` | Max kitchen samples when subsampling |

### Example: Generate placement QA pairs with subsampling

```bash
python scripts/run_placement.py \
    --input_dir=data/hssd_data/new_format \
    --enable_subsampling=True \
    --kitchens_count=40 \
    --living_rooms_count=40 \
    --bedrooms_count=40
```

### Preparing Data for LLM Evaluation

```bash
# Generate JSONL batch files for LLM inference
python scripts/run_preparation.py
python scripts/run_preparation_tools.py
python scripts/run_preparation_images.py
```

## Project Structure

```
FloorplanQA/
├── src/
│   ├── qa_pairs_generation/          # Core QA generation modules
│   │   ├── utils.py                  # Shared utilities (1600+ lines)
│   │   ├── placement/                # Object placement analysis
│   │   │   ├── placement.py          # Main placement logic
│   │   │   └── objects.py            # Object dimension definitions
│   │   ├── shortest_path/            # A* pathfinding
│   │   │   ├── shortest_path.py      # Path generation
│   │   │   └── pathfinder.py         # Visibility graph + Dijkstra
│   │   ├── obstruction/              # Line-of-sight obstruction detection
│   │   ├── view_angle/               # Angle calculations
│   │   ├── repositioning/            # Object sliding distance
│   │   ├── free_space/               # Unoccupied area computation
│   │   ├── max_box/                  # Largest empty rectangle
│   │   └── pair_distance/            # Object-to-object distances
│   ├── llm_eval/                     # LLM evaluation pipeline
│   │   ├── preparation.py            # Batch JSONL generation
│   │   └── questions.py              # Prompt templates
│   └── aggregation/                  # Result aggregation notebooks
├── layout_generation/                # Gemini-based room generation
│   ├── utils.py                      # Visualization utilities
│   ├── kitchen_generation/
│   ├── living_room_generation/
│   └── bedroom_generation/
├── layout_generation_prompts/        # Prompt engineering for layout gen
│   ├── kitchen_prompts.py
│   ├── living_room_prompts.py
│   └── bedroom_prompts.py
├── scripts/                          # CLI entry points
│   ├── run_placement.py
│   ├── run_shortest_path.py
│   ├── run_obstruction.py
│   ├── run_view_angle.py
│   ├── run_repositioning.py
│   ├── run_free_space.py
│   ├── run_max_box.py
│   ├── run_pair_distance.py
│   └── run_preparation*.py
├── data/                             # Input data directory
│   └── hssd_data/
│       └── new_format/               # Room JSON files by type
│           ├── kitchens/
│           ├── living_rooms/
│           └── bedrooms/
├── benchmark/                        # Generated outputs
└── other_formats/                    # Alternative data representations
```

## Data Format

### Room JSON Structure

```json
{
  "layout_id": "102344022_room_0",
  "room_type": "kitchen",
  "room": {
    "width": 8.5,
    "depth": 6.2,
    "units": "meters"
  },
  "room_boundary": [
    {"x": 0.0, "y": 0.0},
    {"x": 8.5, "y": 0.0},
    {"x": 8.5, "y": 6.2},
    {"x": 0.0, "y": 6.2}
  ],
  "walls": [
    {"start": {"x": 0, "y": 0}, "end": {"x": 8.5, "y": 0}}
  ],
  "objects": [
    {
      "label": "table",
      "bbox": [1.0, 2.0, 2.5, 4.0],
      "points": [
        {"x": 2.0, "y": 1.0},
        {"x": 4.0, "y": 1.0},
        {"x": 4.0, "y": 2.5},
        {"x": 2.0, "y": 2.5}
      ]
    }
  ],
  "openings": {
    "windows": [{"label": "window", "points": [...]}],
    "doors": [{"label": "door", "points": [...]}]
  }
}
```

**Important**: Bounding boxes use `[y0, x0, y1, x1]` format (y before x).

### Output CSV Structure

Generated CSV files contain:

| Column | Description |
|--------|-------------|
| `layout_id` | Unique room identifier |
| `room_type` | kitchen / living_room / bedroom |
| `question` | Generated question text |
| `answer` | Ground truth answer |
| Task-specific columns | e.g., `object_1`, `object_2`, `distance`, etc. |

## Supported Room Types

- **Kitchens** - Appliance-focused layouts (stove, fridge, sink, etc.)
- **Living Rooms** - Furniture-focused layouts (sofa, TV, coffee table, etc.)
- **Bedrooms** - Personal space layouts (bed, dresser, wardrobe, etc.)

## Special Handling

### Excluded from Obstacle Calculations
- **Ceiling fixtures**: lights, chandeliers, fans, pendants
- **Floor coverings**: rugs, carpets, mats (objects may overlap these)

### Pathfinding
- Uses visibility graph with configurable clearance (default: 0.1m)
- Dijkstra's algorithm for shortest path computation
- Room corners and object vertices serve as graph nodes

## LLM Evaluation

The `src/llm_eval/` module supports:

- **Batch JSONL generation** for OpenAI/Gemini batch APIs
- **Label swapping ablations** (rotate, reverse, shuffle strategies)
- **XML format conversion** for format ablation studies
- **Multiple model configurations** (GPT-4, Gemini 2.5, Claude, etc.)

### Prompt Templates

Each task has a structured prompt in `src/llm_eval/questions.py` that:
1. Describes the task and room layout format
2. Requests step-by-step reasoning
3. Specifies exact output format (e.g., `*Final answer*: 5.610`)
