# MPNN for general SDPs

## Environment setup

```angular2html
conda create -y -n gnnsdp python=3.11
conda activate gnnsdp

pip3 install torch==2.8.0 --index-url https://download.pytorch.org/whl/cu126
pip install torch_geometric
pip install torch_scatter torch_sparse -f https://data.pyg.org/whl/torch-2.8.0+cu126.html

pip install wandb hydra-core loguru
```

### (Optional) Solvers

For CVXPY with MOSEK solver, `pip install "cvxpy[MOSEK]"`. Note that you may need a license for the MOSEK solver.

For SCS solver with CPU usage only, `pip install scs`. For GPU acceleration, see the official documentation [SCS install](https://www.cvxgrp.org/scs/install/python.html#python-install), and a building example on [Colab](https://colab.research.google.com/drive/1POCgDNFg8fycHMI9T9N6V3iHFhXRthjn?usp=sharing). 

## Replicate results in Table 1
`python run.py --config-name ppgn train.datapath=DATAPATH` plus your extra args

Replace `ppgn` with `edge_gt/two_wl/ign/mpnn` for other baselines. 

### Maxcut
`python run.py --config-name ppgn train.runs=5 train.datapath=/PATH/TO/DATASET gnn.num_conv_layers=10 gnn.no_mp=true gnn.no_dual=true`

We can set no message passing mode, as the constraints are all equivalent and are on the diagonals, therefore all we need is an encoding for diagonal elements. However, if you wish to pretrain a model to warm start the SCS solver, you need predictions on the primal as well as slack and dual variables, therefore you should set `gnn.no_mp=false gnn.no_dual=false train.ckpt=true`.

If Edge GT runs OOM, use batch gradient accumulation by setting a smaller batch size and `train.accum > 1`. Besides, Edge GT is better with the message passing  
`python run.py --config-name edge_gt train.runs=5 train.datapath=/PATH/TO/DATASET gnn.num_conv_layers=10 gnn.no_mp=false gnn.no_dual=true`

### Lovasz
`python run.py --config-name ppgn train.runs=5 train.datapath=/PATH/TO/DATASET gnn.num_conv_layers=6 gnn.no_mp=false gnn.no_dual=true`

### MIS
`python run.py --config-name ppgn train.runs=5 train.datapath=/PATH/TO/DATASET gnn.num_conv_layers=6 gnn.no_mp=false gnn.no_dual=true`

### Cover
`python run.py --config-name ppgn train.runs=5 train.datapath=/PATH/TO/DATASET gnn.num_conv_layers=10 gnn.no_mp=false gnn.no_dual=true`

### SAT 
`python run.py --config-name ppgn train.runs=5 train.datapath=/PATH/TO/DATASET gnn.num_conv_layers=10 gnn.no_mp=true gnn.no_dual=true`

### LMI
`python run.py --config-name ppgn train.runs=5 train.datapath=/PATH/TO/DATASET gnn.num_conv_layers=6 gnn.no_mp=false gnn.no_dual=true`

## Replicate runtime

For CVXPY with MOSEK solver or SCS solver (built-in in CVXPY), run  
`python timing_solver.py --config-name solver solver=mosek train.datapath="/PATH/TO/DATASET"`

For timing the neural architectures, run  
`python timing.py --config-name ppgn train.datapath="/PATH/TO/DATASET"` plus some extra `gnn` parameters. The config `ppgn` can be replaced by `edge_gt`, `two_wl` etc. in `./config` folder. 

For warm starting the SCS solver, run  
`python warm_start_scs.py --config-name ppgn +train.modelpath="/PATH/TO/MODEL_FOLDER" train.datapath="/PATH/TO/DATASET" gnn.no_dual=false` plus some extra `gnn` parameters. Note that a pretrained model directory `"/PATH/TO/MODEL_FOLDER"` is necessary. 

For testing pretrained model on OOD data, run
`python test.py --config-name ppgn +train.modelpath="/PATH/TO/MODEL_FOLDER" train.datapath="/PATH/TO/DATASET"`

## Real-world dataset: SDPLIB
See [SDPLIB](https://github.com/vsdp/SDPLIB) for dataset statistics and downloading. Then you can process the SDPLIB data into our training format with SDPLIB.ipynb. 

For training, run  
`python run_sdplib.py --config-name ppgn train.datapath="/PATH/TO/SDPLIB" gnn.no_dual=false gnn.num_conv_layers=10 train.batchsize=1`

For evaluation, run  
`python warm_start_sdplib --config-name ppgn +train.modelpath="/PATH/TO/MODEL_FOLDER" train.datapath="/PATH/TO/SDPLIB" gnn.no_dual=false gnn.num_conv_layers=10`