# PPI-3
This repo contains the implementation of the ICLR 2026 submitted paper, [Boosting Domain Generalization in Object Detection through the Lens of Phase Invariance]().

![Screenshot](figs/8.png)

Temporal and seasonal variations in dynamic real-world environments result in diverse visual appearances, posing significant challenges for object detection models to maintain consistently high performance. Although existing Domain Generalization (DG) methods have shown promise in enhancing model robustness, they often neglect the spatial structural relationships of objects during the learning of domain-invariant features, thereby limiting their effectiveness in object detection tasks compared to classification tasks. From the perspective of Preserving Phase Invariance (PPI), we propose a novel methodology that aims to enhance model generalization while preserving accurate object localization. This methodology comprises three complementary modules: Mix Normalization Perturbation (MNP), which synthesizes diverse styles to improve robustness; Sensitive Channel Perturbation (SCP), which suppresses domain-specific features at the channel level; and Amplitude-aware Attention (AOA), which applies spectral attention to the amplitude component. Together, these modules promote phase-invariant representations and contribute to improved cross-domain detection performance. Our approach fundamentally reduces the domain generalization gap in classification and detection by maintaining the integrity of key structural information. Our proposed methods achieve state-of-the-art performance on Unsupervised Domain Adaptation and Single Domain Generalization Object Detection benchmarks, even outperforming most recent state-of-the-art Domain Adaptation techniques.

 ## Installation

### Environment
- CUDA 10.2
- Python >= 3.7
- Pytorch >= 1.6
- THe Detectron2 version matches Pytorch and CUDA versions.

### Dependencies

1. Create a virtual env.
- `python3 -m pip install --user virtualenv`
- `python3 -m venv PPI-3`
- `source PPI-3/bin/activate`

- Install Pytorch
`pip3 install torch torchvision`

Check out the previous Pytorch versions [here](https://pytorch.org/get-started/previous-versions/).

- Install Detectron2
Build Detectron2 from Source (gcc & g++ >= 5.4)
`python -m pip install 'git+https://github.com/facebookresearch/detectron2.git'`

Or, you can install Pre-built detectron2 (example for CUDA 10.2, Pytorch 1.9)

`python -m pip install detectron2 -f \
  https://dl.fbaipublicfiles.com/detectron2/wheels/cu102/torch1.9/index.html`

More details can be found [here](https://detectron2.readthedocs.io/en/latest/tutorials/install.html).

## Data Preparation
1.Download the Cityscapes and FoggyCityscpaes dataset [here](https://www.cityscapes-dataset.com/).

2.Download the Diversity Weather dataset [here](https://drive.google.com/drive/folders/1IIUnUrJrvFgPzU8D6KtV0CXa8k1eBV9B).

After downloading, please convert them into coco format and place them under the datasets folder. And configure the data set storage path in src/data/builtin.py.

## Get Started

For joint training,

```python
python3 -m tools.train_net --config-file [config_file] --num-gpus 8
```

For evaluation,

```python
python3 -m tools.train_net --config-file [config_file] --num-gpus [num] --eval-only
```
This command will load the latest checkpoint in the folder. If you want to specify a different checkpoint or evaluate the pretrained checkpoints, you can run 
```python
python3 -m tools.train_net --config-file [config_file] --num-gpus [num] --eval-only MODEL.WEIGHTS [PATH_TO_CHECKPOINT]
```

## Model Weights
Once our paper is accepted, we will provide our model weights here:

PPI-3 for C2F: [Google Drive]().

PPI-3 for Diversity Weather Dataset: [Google Drive]().

## Analysis via Tide Toolbox
|![Screenshot](figs/6.png) | ![Screenshot](figs/7.png)|
|------------------------|------------------------|

We use Tide toolbox to analyze the difference in detecting major errors between methods with and without PPI constraints. We compare our SCP module in PPI-3 with the DomainDrop method on the "night rainy" domain in the Diversity Weather Dataset. For fair comparison, our two methods are MNP+SCP and MNP+DomainDrop. The training parameters are consistent with the parameters of the model trained on the Diversity Weather Dataset benchmark in the paper. The mAP50 scores are 24.3 for PPI versus 22.0 for DomainDrop. Regarding the main errors, the classification dAP is 6.4 (PPI) versus 6.0 (DomainDrop), and the localization dAP is 5.2 (PPI) versus 6.0 (DomainDrop). This indicates that our method PPI, while slightly increasing the classification error, has improved over all detection performance by locating more objects.

## Benchmark Results
![Screenshot](figs/1.png)

![Screenshot](figs/2.png)

![Screenshot](figs/10.png)

![Screenshot](figs/11.png)

## Visualization
![Screenshot](figs/3.png)

![Screenshot](figs/5.png)
