# MSExplainer: Multi-scale Explainer for Graph Neural Networks

## Abstract

Explainability for graph neural networks (GNNs) aims to unveil the complex decision logic of learned models by identifying the most influential structures in the input graph, thereby improving transparency and trustworthiness. Existing post-hoc explainers typically extract a sparse key subgraph at a single scale as the explanation. However, a single-scale view often fails to capture multi-level semantics, and the optimization procedure may degenerate into a local search that is sensitive to initialization and noise, leading to unstable explanations and limited robustness. To address these issues, we propose MSExplainer, a multi-scale explainer for GNNs. MSExplainer couples multi-scale subgraph consistency guidance with single-scale adaptive subgraph learning under a parameter-sharing design. It simultaneously extracts multi-scale key subgraphs and complementary subgraphs, yielding a hierarchical decomposition of the original graph that covers semantics at different granularities and improves the robustness of subgraph extraction. Experiments on six benchmark datasets show that MSExplainer consistently outperforms prior methods in explanation accuracy and fidelity. Moreover, we theoretically prove the upper bound advantage of the multi-scale strategy in representation consistency, and derive that it achieves the same-order computational complexity as single-scale methods under the parameter-sharing mechanism, thus ensuring the high fidelity of key subgraphs while maintaining computational efficiency.

## Environment Setup

### System Requirements
- Python 3.7 or higher
- Windows 10/11 (tested on Windows 10)
- At least 4GB RAM
- CUDA-compatible GPU (optional, for acceleration)

### Installation

#### Method 1: Using Batch Script (Recommended for Windows)
```bash
# Double-click to run
install_packages.bat
```

#### Method 2: Using Python Script
```bash
python install_packages.py
```

#### Method 3: Manual Installation

1. **Create Virtual Environment**
```bash
python -m venv msc_explainer_env
msc_explainer_env\Scripts\activate  # Windows
```

2. **Install PyTorch**
```bash
# CPU version
pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cpu

# GPU version (if NVIDIA GPU available)
pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu118
```

3. **Install PyTorch Geometric**
```bash
pip install torch-scatter torch-sparse torch-cluster torch-spline-conv torch-geometric -f https://data.pyg.org/whl/torch-2.0.0+cpu.html
```

4. **Install Other Dependencies**
```bash
pip install numpy scipy scikit-learn pandas matplotlib seaborn networkx opencv-python tensorboard tensorboardX tqdm jupyter ipywidgets
```

### Verify Installation
```python
python test_installation.py
```

## Usage

### Data Preprocessing

#### BBBP Dataset Oversampling
For the BBBP dataset, you can use the oversampling script to balance the dataset:

```bash
python bbbp_oversampling.py
```

This script will:
- Load the original BBBP dataset
- Apply oversampling techniques to balance the class distribution
- Save the balanced dataset for training
- Generate balanced checkpoints for the MSExplainer

**Note**: Run this script before training on BBBP dataset to ensure balanced class distribution.

### Run the main program
```bash
python MSExplainer_graph.py --dataset DATASET_NAME
```

- The `DATASET_NAME` options are automatically extracted from all checkpoint filenames in the `ckpt` folder (e.g., Mutagenicity, NCI1, BBBP, BA-2motif, etc.).
- For example:
```bash
python MSExplainer_graph.py --dataset NCI1
python MSExplainer_graph.py --dataset Mutagenicity
```