# Installation
1) Experiments were ran using `R` version 4.3.2.
2) We provide a script `install_packages.R` to install all required packages
3) Use `devtools::install('shapr')` to install the adapted `shapr` package

# Generating synthetic data
1) Run `R/experiments/sample_dags.R`
2) Run `R/experiments/sample_scms.R`
3) Run `R/experiments/generate_data.R`

# Training prediction models
Run `R/experiments/train_models.R`

# Causal discovery
- We provide a command line interface to generate the Oracle MECs in `R/experiments/causal_discovery_oracle.R`; usage: `Rscript R/experiments/causal_discovery_oracle.R path_to_dag`
- We provide a command line interface to apply PC to generated data in `R/experiments/causal_discovery_pc.R`; usage: `Rscript R/experiments/causal_discovery_pc.R path_to_data n_cd alpha`
- We provide a command line interface to apply FGES to generated data in `R/experiments/causal_discovery_fges.R`; usage: `Rscript R/experiments/causal_discovery_fges.R path_to_data n_cd`

# Experiment suite
- For each type of Shapley values (marginal, conditional, causal using the true interventional distributions, causal estimated for a provided graph, MEC) we provide command line interfaces of the form `R/experiments/shapley_type.R`. Options for hyperparameters can be found using `Rscript R/experiments/shapley_type.R --help`. Example: `Rscript R/experiments/shapley_marginal.R --modelpath path_to_model --ntoexplain 40 --nobs 1000 --nmc 1000`
- In `R/experiments/jobs` we provide two functions `generate_local_script` and `generate_slurm_job` that can be used to create bash scripts that will run either locally or on a cluster using SLURM. For each experiment setting we have a folder in `R/experiments/jobs` that contains files to generate the necessary experiment scripts.

# shapr usage
We implemented our methods by adapting the `shapr` package (https://github.com/NorskRegnesentral/shapr). We implemented the following 'approaches':
- `approach="causal_true"` computes ground truth causal Shapley values and requires passing an scm as generated in step 2 above
- `approach="causal"` computes causal Shapley values for a specific graph by passing it as `causal_dag`
- `approach="mec"` requires passing a list of igraph graphs, e.g. the output of one of the causal discovery algorithms above, and  `causal_approximation_method = sample` or `causal_approximation_method = iw`.
For more details on usage we refer to the respective `shapley_type.R` files in `R/experiments`

