# GNN Explanations that do not Explain and How to find Them

ICLR26 anonymous supplemental submission.


## Setup

We report the original `setup.py` file as provided by the original codebase on which this is based on. More details regarding the installation can be found in https://github.com/divelab/LECI.

## Datasets

Dataset implementations are provided in `GOOD\data\good_datasets`.

MNIST75sp is generated according to the [original paper](https://github.com/bknyaz/graph_attention_pool/tree/master/scripts).

Raw dataset files are not included for size constraint. When downloading raw datasets, they should be placed in `storage/datasets`.

## Checkpoints

Checkpoints will be later made available as a separata DRIVE folder, as they exceed the size limit. Checkpoints are saved in `storage/checkpoints`.

## Model Implementations

Model implementations can be found here `GOOD\networks\models`.

## Faithfulness Metrics Implementation

The implementation of faithfulness metrics can be found in `GOOD\kernel\pipelines\xai_metric_utils.py`.

## Faithfulness Metrics Implementation

Attack details are available in `GOOD\kernel\pipelines\basic_pipeline.py` in the function `def pretrain_model(...)`.

## Training Details

The file `GOOD\ood_algorithms\algorithms\BaseOOD.py` provides the training logic for each model. Architecture-specific training protocols are available in the corresponding file in the same folder.

## Configurations files

Configuration files and hyper-parameter details for each experiment are available in `configs/final_configs`.
Further run-specific hyperparameter are available in the list of commands below.

## Reproducing the Experiments

Below we report the commands used to reproduce our experiments. The `--task` parameter regulates the behaviour of the code, and can be set as follows for practical utility:

- `test`: Evaluate the trained model
- `train`: Train the model
- `eval_metric`: Evaluate faithfulness metrics (see below for details)
- `plot_panel`: Plot the histogram distribution of explanation relevance scores
- `plot_explanations`: Plot examples of explanations


```shell

##
# RBGV (named BAColorGVIsol in the codebase)
##

# GSAT natural training
goodtg --config_path final_configs/BAColorGVIsol/basis/no_shift/GSAT.yaml --seeds "1/2/3/4/5" --task test --backbone ACR2 --extra_param False 10 0.3
# GSAT attacked
goodtg --config_path final_configs/BAColorGVIsol/basis/no_shift/GSAT.yaml --seeds "1/2/3/4/5" --task test --pretrain degenerate --backbone ACR2
# GSAT attacked but to output sufficient explanations (Sanity check in the Appendix)
goodtg --config_path final_configs/BAColorGVIsol/basis/no_shift/GSAT.yaml --seeds "1/2/3/4/5" --task test --pretrain suff --backbone ACR2

# SMGNN natural training
goodtg --config_path final_configs/BAColorGVIsol/basis/no_shift/SMGNN.yaml --seeds "1/2/3/4/5" --task test --backbone ACR2
# SMGNN attacked
goodtg --config_path final_configs/BAColorGVIsol/basis/no_shift/SMGNN.yaml --seeds "1/2/3/4/5" --task test --pretrain degenerate --backbone ACR2
# SMGNN attacked but to output sufficient explanations (Sanity check in the Appendix)
goodtg --config_path final_configs/BAColorGVIsol/basis/no_shift/SMGNN.yaml --seeds "1/2/3/4/5" --task test --pretrain suff --backbone ACR2
# SMGNN natural degeneracy (sec. 6)
goodtg --config_path final_configs/BAColorGVIsol/basis/no_shift/SMGNN.yaml --seeds "1/2/3/4/5" --task test --entr_coeff 1.0 --l_norm_coeff 0.4 --backbone ACR2 --use_readout_norm bn

# DIR natural training
goodtg --config_path final_configs/BAColorGVIsol/basis/no_shift/DIR.yaml --seeds "1/2/3/4/5" --task test --backbone ACR2 --ood_param 0.5
# DIR attacked
goodtg --config_path final_configs/BAColorGVIsol/basis/no_shift/DIR.yaml --seeds "1/2/3/4/5" --task test --pretrain degenerate --backbone ACR2 --ood_param 0.5
# DIR attacked but to output sufficient explanations (Sanity check in the Appendix)
goodtg --config_path final_configs/BAColorGVIsol/basis/no_shift/DIR.yaml --seeds "1/2/3/4/5" --task test --pretrain suff --backbone ACR2  --ood_param 0.5
# SMGNN low K (sec. 6)
goodtg --config_path final_configs/BAColorGVIsol/basis/no_shift/DIR.yaml --seeds "1/2/3/4/5" --task test --backbone ACR2  --ood_param 0.01




##
# MNIST
##

# GSAT natural training
goodtg --config_path final_configs/MNIST/basis/no_shift/GSAT.yaml --seeds "1/2/3/4/5" --task test --backbone ACR2
# GSAT attacked
goodtg --config_path final_configs/MNIST/basis/no_shift/GSAT.yaml --seeds "1/2/3/4/5" --task test --gpu_idx 1 --pretrain degenerate --backbone ACR2
# GSAT attacked but to output sufficient explanations (Sanity check in the Appendix)
goodtg --config_path final_configs/MNIST/basis/no_shift/GSAT.yaml --seeds "1/2/3/4/5" --task test --gpu_idx 1 --pretrain suff  --backbone ACR2

# SMGNN natural training
goodtg --config_path final_configs/MNIST/basis/no_shift/SMGNN.yaml --seeds "1/2/3/4/5" --task test --gpu_idx 1 --backbone ACR2
# SMGNN attacked
goodtg --config_path final_configs/MNIST/basis/no_shift/SMGNN.yaml --seeds "1/2/3/4/5" --task test --gpu_idx 1 --pretrain degenerate --backbone ACR2
# SMGNN attacked but to output sufficient explanations (Sanity check in the Appendix)
goodtg --config_path final_configs/MNIST/basis/no_shift/SMGNN.yaml --seeds "1/2/3/4/5" --task test --gpu_idx 1 --pretrain suff  --backbone ACR2
# SMGNN natural degeneracy
goodtg --config_path final_configs/MNIST/basis/no_shift/SMGNN.yaml --seeds "1/2/3/4/5" --task test --gpu_idx 1 --ood_param 1 --backbone ACR2

# DIR natural training
goodtg --config_path final_configs/MNIST/basis/no_shift/DIR.yaml --seeds "1/2/3/4/5" --task test --gpu_idx 1 --backbone ACR2 --ood_param 0.8
# DIR attacked
goodtg --config_path final_configs/MNIST/basis/no_shift/DIR.yaml --seeds "1/2/3/4/5" --task test --gpu_idx 1 --pretrain degenerate --backbone ACR2 --ood_param 0.8 --use_readout_norm bn
# DIR attacked but to output sufficient explanations (Sanity check in the Appendix)
goodtg --config_path final_configs/MNIST/basis/no_shift/DIR.yaml --seeds "1/2/3/4/5" --task test --gpu_idx 1 --pretrain suff --backbone ACR2 --ood_param 0.8 --use_readout_norm none
# DIR low K
goodtg --config_path final_configs/MNIST/basis/no_shift/DIR.yaml --seeds "1/2/3/4/5" --task test --gpu_idx 0 --backbone ACR2 --ood_param 0.1 --use_readout_norm bn


##
# Mutagenicity (--backbone ACR2 by default)
##

# SMGNN normal training
goodtg --config_path final_configs/MUTAG/basis/no_shift/SMGNN.yaml --seeds "1/2/3/4/5" --task test --gpu_idx 1
# SMGNN attacked
goodtg --config_path final_configs/MUTAG/basis/no_shift/SMGNN.yaml --seeds "1/2/3/4/5" --task test --gpu_idx 1 --pretrain degenerate

# GSAT normal training
goodtg --config_path final_configs/MUTAG/basis/no_shift/GSAT.yaml --seeds "1/2/3/4/5" --task test --gpu_idx 1
# GSAT attacked
goodtg --config_path final_configs/MUTAG/basis/no_shift/GSAT.yaml --seeds "1/2/3/4/5" --task test --gpu_idx 1 --pretrain degenerate

# DIR normal training
goodtg --config_path final_configs/MUTAG/basis/no_shift/DIR.yaml --seeds "1/2/3/4/5" --task test --gpu_idx 1 --ood_param 0.5
# DIR attacked
goodtg --config_path final_configs/MUTAG/basis/no_shift/DIR.yaml --seeds "1/2/3/4/5" --task test --gpu_idx 1 --ood_param 0.5 --pretrain degenerate



##
# GraphSST2Planted (--backbone ACR2 by default)
##

# GSAT normal training
goodtg --config_path final_configs/SST2Planted/basis/no_shift/GSAT.yaml --seeds "1/2/3/4/5" --task test --gpu_idx 1
# GSAT attacked
goodtg --config_path final_configs/SST2Planted/basis/no_shift/GSAT.yaml --seeds "1/2/3/4/5" --task test --gpu_idx 1 --pretrain degenerate

# SMGNN normal training
goodtg --config_path final_configs/SST2Planted/basis/no_shift/SMGNN.yaml --seeds "1/2/3/4/5" --task test --gpu_idx 1
# SMGNN attacked
goodtg --config_path final_configs/SST2Planted/basis/no_shift/SMGNN.yaml --seeds "1/2/3/4/5" --task test --gpu_idx 1 --pretrain degenerate

# DIR normal training
goodtg --config_path final_configs/SST2Planted/basis/no_shift/DIR.yaml --seeds "1/2/3/4/5" --task test --gpu_idx 1
# DIR attacked
goodtg --config_path final_configs/SST2Planted/basis/no_shift/DIR.yaml --seeds "1/2/3/4/5" --task test --gpu_idx 1 --pretrain degenerate
# DIR low K 
goodtg --config_path final_configs/SST2Planted/basis/no_shift/DIR.yaml --seeds "1/2/3/4/5" --task test --gpu_idx 1 --ood_param 0.1



##
# CPatchMNIST ("Plausible but Unfaithful" experiment in appendix)
##

# SMGNN
goodtg --config_path final_configs/CPatchMNIST/basis/no_shift/SMGNN.yaml --seeds "1/2/3/4/5" --task test --gpu_idx 1 --pretrain suff --backbone ACR2
# GSAT
goodtg --config_path final_configs/CPatchMNIST/basis/no_shift/GSAT.yaml --seeds "1/2/3/4/5" --task test --gpu_idx 1 --pretrain suff --backbone ACR2
```



## Computing faithfulness metrics 




To compute faithfulness metrics, switch the `--task` to `eval_metric`, and use the parameter `--metrics` to specify a list of `/`-separated metrics names. The parameter `--metrics` can be set to the following values:

- fidm
- fidp
- rfidm
- rfidp
- suff
- nec
- counter_fid (aka CF in our paper)
- suff_cause (aka EST in our paper)



Examples are reported refer to `scripts\faithfulness_deg.sh` and `scripts\faithfulness_suff.sh`, for the attack presented in Sec. 4 and the sanity check in the Appendix, respectively.

It is possible to regulate the budget of perturbations to use via the parameter `--expval_budget`. An example is shown in `scripts\faithfulness_ablation_expval_budget.sh`.