Geometric Superpixel Representations for Efficient Image Classification with Graph Neural Networks

Published: 31 Jul 2023, Last Modified: 10 Aug 2023VIPriors 2023 OralPosterTBDEveryoneRevisionsBibTeX
Keywords: image classification, graph neural networks, superpixel segmentation, region adjacency graph
TL;DR: We propose ShapeGNN for efficient superpixel-based image classification with the novel addition of shape information.
Abstract: While Convolutional Neural Networks and Vision Transformers are the go-to solutions for image classification, their model sizes make them expensive to train and deploy. Alternatively, input complexity can be reduced following the intuition that adjacent similar pixels contain redundant information. This prior can be exploited by clustering such pixels into superpixels and connecting adjacent superpixels with edges, resulting in a sparse graph representation on which Graph Neural Networks (GNNs) can operate efficiently. Although previous work clearly highlights the computational efficiency of this approach, this prior can be overly restrictive and, as a result, performance is lacking compared to contemporary dense vision methods. In this work, we propose to extend this prior by incorporating shape information into the individual superpixel representations. This is achieved through a separate, patch-level GNN. Together with enriching the previously explored appearance and pose information of superpixels and further architectural changes, our best model, ShapeGNN, surpasses the previous state-of-the-art in superpixel-based image classification on CIFAR-10 by a significant margin. We also present an optimised pipeline for efficient image-to-graph transformation and show the viability of training end-to-end on high-resolution images on ImageNet-1k.
Supplementary Material: pdf
Submission Number: 3
Loading