# SceneNAT: Masked Generative Modeling for Language-Guided Indoor Scene Synthesis

This repository contains the implementation of SceneNAT, a novel approach for language-guided 3D indoor scene generation using non-autoregressive transformers.

## Installation

### Prerequisites
- Python 3.8+
- CUDA 11.8+
- Conda

### Environment Setup
1.  **Create Conda Environment:**
    ```bash
    conda create -n scenenat --file settings/packages.txt
    conda activate scenenat
    ```

2.  **Install PyTorch:**
    ```bash
    python -m pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu118
    ```

3.  **Install Dependencies:**
    ```bash
    python -m pip install -r settings/requirements.txt
    python -c "import nltk; nltk.download('cmudict')"
    python -m pip install openai-clip open-clip-torch matplotlib
    ```

4.  **Download the Blender software for visualization.**
    ```bash
    cd blender
    wget https://download.blender.org/release/Blender3.3/blender-3.3.1-linux-x64.tar.xz
    tar -xvf blender-3.3.1-linux-x64.tar.xz
    rm blender-3.3.1-linux-x64.tar.xz
    ```

## Dataset

The 3D-FRONT dataset should be preprocessed and placed in the `dataset/` directory. Please refer to the provided preprocessing scripts for details.

## Training

To train the SceneNAT model, use the provided script for distributed training on multiple GPUs.

1.  **Configure Training:**
    Open `scripts/train_ddp.sh` and set the following key variables:
    -   `roomtype`: The room type to train on (`bed`, `dining`, or `living`). This determines which configuration from the `configs/` directory is used (e.g., `bed` uses `configs/bed.yaml`).
    -   `exp_name`: A unique name for your experiment.
    -   `model`: The model version to train (e.g., `baseline`).
    -   `CUDA_VISIBLE_DEVICES`: The GPU IDs to use.

2.  **Start Training:**
    ```bash
    bash scripts/train_ddp.sh
    ```
    The script will handle the multi-GPU setup and save checkpoints to the `output/` directory.

## Evaluation

**Usage:**
```bash
bash scripts/eval.sh <model_version> <room_type>
```
```

## Output and Visualization

All training and evaluation outputs, including checkpoints, generated scenes, and visualizations, are saved to the `output/` directory.

The directory is structured as follows:
```
output/
└── scenenat/
    └── <room_type>/
        ├── checkpoints/
        ├── generated_scenes/
        └── ...
```

Visualizations of the generated scenes are automatically created by the evaluation script, as it includes the `--visualize` flag by default.
