# Hard-Constrained Graph Generation with Discrete-Projection Diffusion

## Overview

In this paper, we propose **NSPSG**, a discrete graph generation framework that enforces various constraints. By introducing an SMT (Satisfiability Modulo Theories)-based projector, NSPSG can ensure that the sampled graphs not only strictly satisfy specific constraints, but also remain within the distribution of the training data. Using supervised auto-regressive neural networks to simulate the solving process of SMT, NSPSG significantly enhances solving efficiency while ensuring accuracy.

![Overview of NSPSG](figs/method.png)

Across heterogeneous constraints (arithmetic, structural, and even combined) and various graph datasets (synthetic and real-world), NSPSG achieves 99%-100% precision, demonstrating state-of-the-art performance. Notably, it successfully handles a complex non-linear constraint, improving data validity by up to 44 percentage points and reaching 99% accuracy. 

## Environment installation

Our code was built on [ConStruct](https://github.com/manuelmlmadeira/ConStruct) and [LayerDAG](https://github.com/Graph-COM/LayerDAG), please refer to the sub folder or the targeted links to install the corresponding environment.

Specifically, this code was tested with PyTorch 2.2.0, cuda 12.1, torch_geometrics 2.3.1, Python 3.9 and Ubuntu 22.04. We provide the following environment installation commands and the environment can be installed easily.

```bash
conda create -c conda-forge -n NSPSG rdkit=2023.03.2 python=3.9
conda activate NSPSG
conda install -c conda-forge graph-tool=2.45
conda install -c "nvidia/label/cuda-12.1.0" cuda
pip install torch==2.2.0 --index-url https://download.pytorch.org/whl/cu121
conda install -c conda-forge cudatoolkit=12.1
pip install  dgl -f https://data.dgl.ai/wheels/torch-2.2/cu121/repo.html
pip install z3-solver
pip install -r requirements.txt
```


## Run the code

NSPSG is a plug-and-play framework, so it can be easily run our code based on these corresponding links aforementioned. We provide a fast version to run the experiment:

### Construct
- We use Construct to evaluate NSPSG for Structural Constraints and Combined Constraints. First enter the folder by running ```cd Construct``` 
- The constraints can be set through `configs/`. Notably, we suggest using 'marginal' in `configs/model/discrete.yaml` for SMT projection
- To train the diffusion model: ```python3 main.py``` and set train to True in `configs/model/discrete.yaml`
- To test the diffusion model: ```python3 main.py +experiment=<dataset_name> general.test_only=<path>``` and set train to False in `configs/model/discrete.yaml`
- Below are two visualizations depicting valid TLS samples: 

![TLS Samples](figs/tls.png)

### LayerDAG
- We use LayerDAG to evaluate NSPSG for Non-Linear Arithmetic Constraints. First enter the folder by running ```cd LayerDAG``` 
- First generate the training data using z3 solver (we separate this part from the sampling process for convenience): ```python generate_smt_data.py --rho rho --iter iter```, where rho is in {0.0, 0.5, 1.0} and iter can be used to control the amount of training data. We suggest set iter to 10.
- To train the auto-regressive neural corrector: ```python train_smt_ar.py --rho rho```
- To train the diffusion model: ```python train.py --rho rho```
- To test the diffusion model without projection: ```python sample.py --model_path <path> --rho rho```
- To test the diffusion model with SMT projection: ```python sample.py --model_path <path> --check True --solve True --rho rho```
- To test the diffusion model with neural projection: ```python sample.py --model_path <path> --check True --check_with_refine True --rho rho```