# Domain-Aware Gradient Reuse for Anomaly Detection

This repository contains the official implementation of the paper: **Domain-Aware Gradient Reuse for Anomaly Detection (DAGR)**.

DAGR is a transfer-learning framework designed for anomaly detection, particularly effective when the target domain has few or no labeled anomalies. It works by selectively reusing gradient information from a source domain rich in labeled outliers to guide the model training in the target domain.

## Highlights

-   **Top-Tier Performance**: DAGR achieves a mean AUROC of 81.81%, outperforming the top unsupervised and domain-generalization methods by approximately 6 percentage points.
-   **Novel Gradient-Centric Adaptation**: Instead of aligning features, DAGR introduces a new perspective by operating directly on the gradients. This approach is founded on the observation that gradient distributions show remarkable consistency across related domains, making them a more effective conduit for knowledge transfer.
-   **No Target Anomalies Required**: The framework is designed to transfer anomaly knowledge from a source domain without needing any anomalous samples from the target domain during training. 

## Framework

The DAGR framework operates on two architecturally identical networks: a source network and a target network. The core idea is to estimate the missing anomaly gradients in the target domain by transforming and reusing gradients from the source domain. This is accomplished in two main steps:

**Step 1: Consistent Channel Selection (CCCS)** 

The DAGR framework identifies sub-modules that behave consistently across both source and target domains by computing the cosine similarity between the normal-sample gradients for each corresponding channel. Channels exhibiting low similarity are considered domain-specific and are subsequently masked by nullifying their gradients. This process prevents them from biasing the subsequent knowledge transfer.

**Step 2: Adaptive Domain-Specific Perturbation Removal (ADPR)** 

After the initial filtering, the framework learns an adaptive, non-linear mapping to transform the source's normal gradients. The same learned transformation is then applied to the masked source anomaly gradients to generate surrogate target anomaly gradients. These estimated anomaly gradients are combined with the target's normal gradients to update the target model, allowing it to learn discriminative features without being exposed to any labeled anomalies.

## Environment Setup

This guide requires Anaconda or Miniconda for managing dependencies. Please note that a CUDA-enabled GPU is necessary.

```bash
pip install -r requirements.txt
```


## Dataset Preparation

This project evaluates **GR-DPR** on two benchmarks:

- **DCASE 2020 Task 2** \[[1](#references)\]  
  - **Source domain:** `Fan`  
  - **Target domains:** `Pump`, `Slider`, `Valve`, `ToyCar`, `ToyConv`
- **DAGM** \[[2](#references)\]  
  - **Source domain:** `Class 2`  
  - **Target domains:** `Class 1`, `Class 3`, `Class 6`

### Directory layout

```text
datasets/
├── DCASE/
│   ├── Fan/           # source
│   │   ├── train/     # 100 % normal + 100 % anomalous
│   │   └── test/      # full test set
│   ├── Pump/          # target
│   │   ├── train/     # 10 % normal
│   │   └── test/      # 90 % normal + 100 % anomalous
│   ├── Slider/
│   ├── Valve/
│   ├── ToyCar/
│   └── ToyConv/
│       └── …          # same split rule as Pump
└── DAGM/
    ├── Class 2/        # source
    │   ├── train/     # 100 % normal + 100 % anomalous
    │   └── test/
    ├── Class 1/         # target
    ├── Class 3/
    └── Class 6/
        └── …          # same split rule as Cable

Sampling rule
• For source domains (Fan, Bottle) all normal and anomalous samples are placed in train/.
• For target domains only 10 % of the normal samples are copied into train/; every remaining sample (90 % normal + all anomalies) is moved to test/.
This emphasizes DAGR’s cross-domain generalisation under limited target supervision.

```

---

## Quick Start

Follow these two simple steps to launch **GR-DPR** training on any source–target pair.

### 1&nbsp;&nbsp;Edit `config.py`

Specify the data folders for the current experiment:

```python
config = {
    # Path to *source-domain* training set (all normal + all anomalies)
    "train_dir": "datasets/DCASE/Fan/train",

    # Path to *target-domain* test set (90 % normal + all anomalies)
    "test_dir":  "datasets/DCASE/Pump/test",

    # Path to *target-domain* training subset (10 % normal)
    "align_dir": "datasets/DCASE/Pump/train",
}
```

Tip:
Replace the three paths above with the directories of any other source / target combination, for example
datasets/MVTec/Bottle/train, datasets/MVTec/Cable/test, and datasets/MVTec/Cable/train.


### 2&nbsp;&nbsp;Run Training

```bash

python train.py

```
The script will read config.py, load the specified datasets, and begin training automatically. 






<a id="references"></a>

### References

1. Koizumi Y. *et al.* **Description and Discussion on DCASE 2020 Challenge Task 2**. *DCASE 2020 Workshop*, 2020.  
2. Matthias W. *et al.* **Weakly supervised learning for industrial optical inspection**. *PR*, 2007.


## Citation

If you find our repository useful for your research, please consider citing our paper:
