
# Broaden the Spectrum

A first-principles approach to using more of the graph spectrum in graph transformers.

### Setup
We suggest using this codebase with Python 3.9. 

This project's environment is managed using `venv`. To create an environment, run:
 ```bash 
 source venv_setup.sh
 source venv/bin/activate
 export PYTHONPATH=$PYTHONPATH:$(pwd)
```

### Preprocessing data

 ```bash
python preprocess.py dataset=<dataset_shortname> 
```

A list of all supported datasets can be found in `configs/dataset`
 
### Training

 ```bash
python train.py dataset=<dataset_shortname> model=<model> eigen_subset=<eigen_subset>
```
Possible values for `<model>`:
- `gt`
- `gt-bts`
- `nagphormer`
- `nagphormer-bts`
- `gps`
- `gps-bts`

Possible values for `<eigen_subset>`:
- `truncate`
- `bts`

Other configurations can be found in `configs/train.yaml`

### Hyperparameter Tuning
We use Ray to perform hyperparameter tuning.
First, start a ray server using (only need to do once):
```bash
ray start --head
```

then run the following:

 ```bash
python <ray_script> \
  out_dir=../BTS/ray_experiments \
  dataset=<dataset_shortname> \
  model=<model> \
  eigen_subset=<eigen_subset> \
  ray.name=experiment1 
```

Choose `<ray_script>` based on the following:
- GT-BTS: `ray_scripts/bts.py`
- Baselines: `ray_scripts/baseline.py`


Other configuration options can be found in `configs/ray.yaml`.


### Preprocessing large graphs
#### Installing SlepC4py
Requires some system tools: `MPI`, `FORTAN`, `BLAS`, and `LAPACK`. On Ubuntu, do the following:
```bash
sudo apt-get install \
  openmpi-bin \
  openmpi-doc \
  libopenmpi-dev \
  gfortran \
  libblas-dev \
  liblapack-dev
```

Then, you can do:
```bash
uv pip install mpi4py petsc
pip install petsc4py  # does not work with uv
pip install slepc slepc4py  # does not work with uv
```