Gradient-Based Clean Label Backdoor Attack to Graph Neural Networks

Ryo Meguro; Hiroya Kato; Shintaro Narisada; Seira Hidano; Kazuhide Fukushima; Takuo Suganuma; Masahiro Hiji

Gradient-Based Clean Label Backdoor Attack to Graph Neural Networks

Ryo Meguro, Hiroya Kato, Shintaro Narisada, Seira Hidano, Kazuhide Fukushima, Takuo Suganuma, Masahiro Hiji

Published: 01 Jan 2024, Last Modified: 22 Oct 2024ICISSP 2024EveryoneRevisionsBibTeXCC BY-SA 4.0

Abstract: Graph neural networks (GNNs) can obtain useful information from graph structured data. Although its great capability is promising, GNNs are vulnerable to backdoor attacks, which plant a marker called trigger in victims’ models to cause them to misclassify poisoned data with triggers into a target class. In particular, a clean label backdoor attack (CLBA) on the GNNs remains largely unexplored. Revealing characteristics of the CLBA is vital from the perspective of defense. In this paper, we propose the first gradient based CLBA on GNNs for graph classification tasks. Our attack consists of two important phases, the graph embedding based pairing and the gradient based trigger injection. Our pairing makes pairs from graphs of the target class and the others to successfully plant the backdoor in the target class area in the graph embedding space. Our trigger injection embeds triggers in graphs with gradient-based scores, yielding effective poisoned graphs. We conduct experiments on multi

Loading