# SC^2-WM

SC^2-WM for Vision-and-Language Navigation in Continuous Environments

## Introduction

## Project Structure

```
SC2-WM/
├── .gitignore
├── README.md
├── environment.yaml
├── run.py
├── bert_config/
│   └── bert-base-uncased/
│       └── ...
├── data/
│   ├── ViT-B-16.pt                    # Pretrained CLIP model (download required)
│   └── checkpoints/
│       └── ckpt.46600.pth             # Model checkpoint (download required)
├── pretrained/                         # Pretrained models directory
│   └── ...
├── habitat_extensions/
│   └── ...
├── run_r2r/
│   ├── main.bash                      # Main training/evaluation script for R2R
│   ├── iter_train.yaml
│   └── r2r_vlnce.yaml
├── utils_p/
│   └── ...
└── vlnce_baselines/
    └── ...
```

## Usage

### Prerequisites

1. **Environment Setup**

   First, create the conda environment and install PyTorch and basic dependencies using `environment.yaml`.

   ```bash
   conda env create -f environment.yaml
   conda activate sc2-wm
   ```

2. **Install Habitat**

   Follow the Habitat Installation Guide and VLN-CE to install habitat-lab.  
   (Note: habitat-sim v0.2.1 is already included in our environment.yaml. Please ensure habitat-lab version is compatible).

3. **Install Extension Modules**

   Install `torch_kdtree` and `tinycudann` inside the activated environment.

   - Install `torch_kdtree` for K-nearest feature search:

     ```bash
     git clone https://github.com/thomgrand/torch_kdtree
     cd torch_kdtree
     git submodule init
     git submodule update
     pip3 install .
     cd ..
     ```

   - Install `tinycudann` for faster multi-layer perceptrons (MLPs):

     ```bash
     pip3 install "git+https://github.com/NVlabs/tiny-cuda-nn/#subdirectory=bindings/torch"
     ```

4. **Download Data**

   Please download the pretrained models and checkpoints from [Google Drive](https://drive.google.com/drive/folders/1hP8oIbn6bcDA3ZNLJLV0hSAPqgFvurxl?usp=drive_link).


## Online Evaluation

Use pseudo interactive demonstrator to equip the model with our SC^2-WM:

```bash
bash run_r2r/main.bash eval
```

### Notice

Our codes are uploaded only for peer review, please do not distribute them.  
The code is used to reproduce our experimental results on R2R-CE dataset.  
The complete code will be released if the paper is accepted.