# DUPS

This software project accompanies the research paper, *DUPS: Dynamic upsampling for efficient semantic segmentation*.

## Introduction

We present **DUPS**, a coarse-to-fine vision transformer for semantic segmentation. 
Unlike models that begin with dense high-resolution tokens, 
DUPS starts at low resolution and dynamically upsamples only regions predicted to contain semantic boundaries, 
following a “one-token-one-class” principle. Mixed-resolution attention enables interaction between coarse and fine tokens, 
allocating computation to semantically complex areas while avoiding redundant processing in homogeneous regions.  
Experiments on ADE20K, COCO-Stuff, and Cityscapes demonstrate that DUPS achieves state-of-the-art results on ADE20K and 
COCO-Stuff with substantially fewer FLOPs, 
and delivers competitive accuracy on Cityscapes at markedly lower compute. 
For example, DUPS-Base attains **54.6 mIoU** on ADE20K in the ~110M-parameter class while using fewer FLOPs 
than comparable backbones.


## Getting Started

### Create environment and install requirements

```bash
sh create_env.sh
```

See further documentation inside the script file.

Our experiments are run with `CUDA==11.6` and `pytorch==1.12`.

### Prepare data

Please refer to [dataset README](datasets/README.md).

### Prepare pre-trained backbone checkpoint

Use `tools/convert-pretrained-model-to-d2.py` to convert any torch checkpoint `.pth` file trained on ImageNet into a Detectron2 model zoo format `.pkl` file.
```
python tools/convert-pretrained-model-to-d2.py dups_tiny.pth dups_tiny.pkl
```
Otherwise, d2 will assume the checkpoint is for the entire segmentation model and will not add `backbone.` to the parameter names, and thus the checkpoint will not be properly loaded.
This is not needed for later fine-tuned checkpoints.

### Train and evaluate

Modify the arguments in script `run_dups_segmentation.sh` and run
```bash
sh run_dups_segmentation.sh
```
for training or evaluation.

One can also directly modify the config files in `configs/`.

### Visualize predictions for pre-trained models

See script `run_demo.sh`. More details can be found in [Mask2Former GETTING_STARTED.md](https://github.com/facebookresearch/Mask2Former/blob/main/GETTING_STARTED.md).

### Analyze model FLOPs

See [tools README](tools/README.md).

