Characterize and Transfer Attention in Graph Neural Networks

Mufei Li; Hao Zhang; Xingjian Shi; Minjie Wang; Yixing Guan; Zheng Zhang

Characterize and Transfer Attention in Graph Neural Networks

Mufei Li, Hao Zhang, Xingjian Shi, Minjie Wang, Yixing Guan, Zheng Zhang

25 Sept 2019 (modified: 05 May 2023)ICLR 2020 Conference Blind SubmissionReaders: Everyone

TL;DR: An analytic paradigm for studying attention in graph neural networks and an approach to perform transfer learning for graph sparsification

Abstract: Does attention matter and, if so, when and how? Our study on both inductive and transductive learning suggests that datasets have a strong influence on the effects of attention in graph neural networks. Independent of learning setting, task and attention variant, attention mostly degenerate to simple averaging for all three citation networks, whereas they behave strikingly different in the protein-protein interaction networks and molecular graphs: nodes attend to different neighbors per head and get more focused in deeper layers. Consequently, attention distributions become telltale features of the datasets themselves. We further explore the possibility of transferring attention for graph sparsification and show that, when applicable, attention-based sparsification retains enough information to obtain good performance while reducing computational and storage costs. Finally, we point out several possible directions for further study and transfer of attention.

Keywords: Graph Neural Networks, Graph Attention Networks, Attention, Transfer Learning, Empirical Study

Original Pdf: pdf

11 Replies

Loading