# The following code repository provides the source code for the paper "Which Algorithms Can Graph Neural Networks Learn?". 

## Code structure
This repository contains the following files:
graphgen.py: logic for graph generation and for training/test datasets
main.py: training loop and logging
model.py: code for the model and model layers
utils.py: utility methods

## Requirements
The code requires the following packages:
torch 
torch-geometric
networkx
tqdm
numpy
pandas
wandb
argparse
loguru

## Experiments

To replicate the experiments the following settings can be used. 

In main.py set the variables below as follows:
data = load_train_data() for the standard dataset
data = load_random_train_data() for randomly generated training graphs
data = load_random_train_data_paths() for randomly generated training graphs and the minimum required path graphs
test_data = load_test_data() for ER-constdeg and ER datasets
test_data = load_complete_test_data() for general dataset



Q1:
python main.py --num_steps 160000 
                    --seed 1
                    --learning_rate 0.0001
                    --input_dim 1 
                    --hidden_dim 64
                    --output_dim 16
                    --num_layers 2
                    --num_layers_mlp 2 
                    --log_every 100 
                    --save_every 40000 
                    --run_name test
                    --batch_size 1
                    --test_during_training 
                    --weight 50
                    --num_nodes_test 64 
                    --num_graphs 50 
                    --steps_test 2

Q2:
In graphgen.py set less_expressive_mpnn=True in the corresponding loader methods for each graph loader.

python main.py --num_steps 160000 
                    --seed 1
                    --learning_rate 0.0001
                    --input_dim 1 
                    --hidden_dim 64
                    --output_dim 16
                    --num_layers 2
                    --num_layers_mlp 2 
                    --log_every 100 
                    --save_every 40000 
                    --run_name test
                    --batch_size 1
                    --test_during_training 
                    --weight 50
                    --num_nodes_test 64 
                    --num_graphs 50 
                    --steps_test 2

Q3:
Set reg_term to either 1 or 2 depending on the regularization.
def l1_regularized_loss(out, batch, model=None, eta=0.1):
        pred, target = _select_scalar(out, batch)
        return torch.mean(torch.abs(pred - target)), eta* reg_term(model, 1) 


python main.py --num_steps 160000 
                    --seed 1
                    --learning_rate 0.0001
                    --input_dim 1 
                    --hidden_dim 64
                    --output_dim 16
                    --num_layers 2
                    --num_layers_mlp 2 
                    --log_every 100 
                    --save_every 40000 
                    --run_name test
                    --batch_size 1
                    --test_during_training 
                    --weight 50
                    --num_nodes_test 64 
                    --num_graphs 50 
                    --steps_test 2