TabGraphs: A New Benchmark and Strong Baselines for Learning on Graphs with Tabular Node Features

Published: 16 Nov 2024, Last Modified: 26 Nov 2024LoG 2024 PosterEveryoneRevisionsBibTeXCC BY 4.0
Keywords: graph machine learning, tabular machine learning, graph neural network, gradient boosting, benchmark
TL;DR: We address the gap between tabular and graph machine learning by collecting a new benchmark of meaningful tabular datasets with known graph structure and introducing strong and simple baselines previously overlooked by research community.
Abstract: In the field of tabular ML, table rows are typically treated as independent data samples, but sometimes additional information about relations between them is available, and such information can be naturally modeled with a graph. Thus, tabular ML may benefit from graph ML methods. However, graph ML models are typically evaluated on datasets with homogeneous node features, which are very different from heterogeneous mixtures of numerical and categorical features present in tabular datasets. Hence, there is a critical difference between the data used in tabular and graph ML studies, which does not allow one to understand how successfully graph models can be transferred to tabular data. To bridge this gap, we propose a new benchmark of diverse graphs with heterogeneous tabular node features and realistic prediction tasks. We use this benchmark to evaluate a vast range of models, including simple methods previously overlooked in the literature. Our experiments show that graph neural networks indeed can often bring gains in predictive performance for tabular data, but standard tabular models can also be adapted to work with graph data by using simple graph-based feature augmentation, which sometimes enables them to compete with and even outperform graph neural models. Based on our empirical study, we provide insights for researchers and practitioners in both tabular and graph ML fields.
Submission Type: Extended abstract (max 4 main pages).
Poster: jpg
Poster Preview: jpg
Submission Number: 84
Loading