# To be filled by the author(s) at the time of submission
# -------------------------------------------------------

title: "[Re] On Explainability of Graph Neural Networks via Subgraph Explorations"

authors:
  - name: Yannik Mahlau
    orcid: 0000-0003-0425-5003
    email: yannik.mahlau@stud.uni-hannover.de
    affiliations: 1,* 
    
  - name: Leonie Kayser
    orcid: 0000-0002-4749-4920
    email: kayser@stud.uni-hannover.de
    affiliations: 1,2
    
  - name: Lukas Berg
    orcid: 0000-0002-5684-6975
    email: lukas.berg@stud.uni-hannover.de
    affiliations: 1
    
affiliations:
  - code:    1
    name:    Leibniz Universität Hannover
    address: Hannover, Germany
  - code:    2
    name:    Max Planck Institute for Mathematics in the Sciences
    address: Leipzig, Germany


keywords: rescience c, rescience x, machine learning, explainable ai, graph neural networks, subgraphX, python

code:
  - url: https://github.com/ymahlau/subgraphx
  - doi: https://zenodo.org/badge/latestdoi/637344138
  - swh: swh:1:dir:439719e0ad99cbd3d980619c24dec1744b408dd0


data:
  - url:
  - doi:

replication:
 - cite: "H. Yuan, H. Yu, J.Wang, K. Li, and S. Ji. “On Explainability of Graph Neural Networks via Subgraph Explorations.” In: Proceedings of the 38th International Conference on Machine Learning. Vol. 139. Proceedings of Machine Learning Research. PMLR, July 2021, pp. 12241–12252."
 - bib: Yuan21
 - url: http://proceedings.mlr.press/v139/yuan21c/yuan21c.pdf
 - doi: 10.48550/ARXIV.2102.05152 identifier

abstract: "Yuan et al. claim their proposed method SubgraphX achieves (i) higher fidelity in explaining models for graph- and node classification tasks compared to other explanation techniques, namely GNNExplainer. Additionally, (ii) the computational effort of SubgraphX is at a 'reasonable level', which is not further specified by the original authors. We define this as at most ten times slower than GNNExplainer. \nWe reimplemented the proposed algorithm in PyTorch. Then, we replicated the experiments performed by the authors on a smaller scale due to resource constraints. Additionally, we checked the performance on a new dataset and investigated the influence of hyperparameters. Lastly, we improved SubgraphX using greedy initialization and utilizing fidelity as a score function. \nWe were able to reproduce the main claims on the MUTAG dataset, where SubgraphX has a better performance than GNNExplainer. Furthermore, SubgraphX has a reasonable runtime of about seven times longer than GNNExplainer. We successfully employed SubgraphX on the Karate Club dataset, where it outperforms GNNExplainer as well. The hyperparameter study revealed that the number of Monte-Carlo Tree search iterations and Monte-Carlo sampling steps are the most important hyperparameters and directly trade performance for runtime. Lastly, we show that our proposed improvements to SubgraphX significantly enhance fidelity and runtime. \nThe authors' description of the algorithm was clear and concise. The original implementation is available in the DIG-library as a reference to our implementation. \nThe authors performed extensive experiments, which we could not replicate in their full scale due to resource constraints. However, we were able to achieve similar results on a subset of the datasets used. Another issue was that despite the original code of the authors and datasets being publicly available, there were many compatibility issues. \nThe original authors briefly reviewed our work and agreed with the findings."

bibliography: bibliography.bib

type: Replication

domain: ML Reproducibility Challenge 2022

language: Python

  
# To be filled by the author(s) after acceptance
# -----------------------------------------------------------------------------

review: 
  - url: https://openreview.net/forum?id=zKBJw4Ht8s

contributors:
  - name:
    orcid: 
    role: editor
  - name:
    orcid:
    role: reviewer
  - name:
    orcid:
    role: reviewer

# This information will be provided by the editor
dates:
  - received:  November 1, 2018
  - accepted:
  - published: 

# This information will be provided by the editor
article:
  - number: 1 # Article number will be automatically assigned during publication
  - doi:    # DOI from Zenodo
  - url:    # Final PDF URL (Zenodo or rescience website?)

# This information will be provided by the editor
journal:
  - name: "ReScience C"
  - issn: 2430-3658
  - volume: 9
  - issue: 2
