## Relational Graph Transformer (RELGT)

This repository implements the "Relational Graph Transformer" (RELGT), the first graph transformer architecture specifically designed for multi-table relational data represented as heterogeneous temporal graphs. The code is under active development.

# Overview

Relational Deep Learning (RDL) is a promising approach for building state-of-the-art predictive models on multi-table relational data. While Graph Neural Networks (GNNs) have been commonly used for this task, they struggle with capturing complex structural patterns and long-range dependencies. Our Relational Graph Transformer (RELGT) addresses the unique challenges of applying transformers to relational entity graphs through:

1. A novel multi-element tokenization strategy that decomposes each node into five components (features, type, hop distance, time, and local structure)
2. An architecture that combines local attention over sampled subgraphs with global attention to learnable centroids
3. Efficient encoding of heterogeneity, temporality, and topology without expensive precomputation

# Environment Setup

We provide environment setup instructions using Micromamba (a lightweight, faster alternative to Conda):

```
# Install Micromamba
mkdir ~/micromamba
wget -qO- https://micromamba.snakepit.net/api/micromamba/linux-64/latest | tar -xvj -C ~/micromamba
echo 'export PATH=~/micromamba/bin:$PATH' >> ~/.bashrc
source ~/.bashrc

# Create and activate environment
micromamba create -n gt python=3.12
micromamba activate gt

# Install PyTorch with CUDA
micromamba install pytorch torchvision torchaudio pytorch-cuda=12.1 -c pytorch -c nvidia

# Install uv for faster pip installations
curl -LsSf https://astral.sh/uv/install.sh | sh 

# Install PyTorch Geometric and its dependencies
uv pip install pyg-lib -f https://data.pyg.org/whl/torch-2.5.0+cu121.html
uv pip install torch-scatter torch-sparse torch-cluster torch-spline-conv -f https://data.pyg.org/whl/torch-2.5.0+cu121.html
micromamba install pyg -c pyg

# Install other dependencies
micromamba install -c conda-forge wandb absl-py tensorboard einops matplotlib progressbar pandas numba networkx scikit-network ipykernel tqdm
uv pip install kmeans-pytorch torchviz fastcluster opentsne ogb kmedoids relbench pytorch_frame[full] sentence-transformers h5py
```

# Running Experiments

To reproduce our experimental results on the RelBench, choose and bash run the script from "expts/" to run specific experiments:

1. run-hyperparam-sweep-small-experiments.sh -> Full hyperparameter sweep through the smaller tasks/datasets
2. run-large-base-experiments.sh -> Runs the baseline model code for the 9 larger tasks/datasets
3. run-global-ablation.sh -> Runs the global module ablation experiment (compares full relGT vs relGT without global)
4. run-k-ablation.sh -> Runs the k ablation experiment (K = 100, 300, 500)
5. run-lappe-ablation.sh -> Runs the laplation positional encoding ablation experiment (GNN pe_dim set to 4 and 0)
6. run-encoder-ablation.sh -> Runs the encoder ablation experiment

For example,

```
bash expts/run-hyperparam-sweep-small-experiments.sh
```
