# GSNF

## Requirements

GSNF has tested using Python 3.10 and Conda environment management tool.

To have consistent libraries and their versions, you can install needed dependencies 
for this project running the following command:

```
conda env create -f environment.yml
```

## Datasets

We used 5 public datasets in the experiments. MIMIC-IV and eICU need users to register as a credentialed user and sign a data use agreement on the PhysioNet Platform (https://physionet.org/), to obtain the access. PhysioNet 2012,P12 and P19 can be freely obtained without registration.


### MIMIC-IV

- **Obtain**: We used MIMIC-IV v1.0, which can be downloaded from https://physionet.org/content/mimiciv/1.0/. 

- **Preprocess**: The preprocessing programs can be found in folder "*preprocess/pre_mimic4*". They were adapted from the repository of [Neural Flow](https://github.com/mbilos/neural-flows-experiments/tree/master/nfe/experiments/gru_ode_bayes/data_preproc).

### eICU

- **Obtain**: We used eICU v2.0, which can be downloaded from https://physionet.org/content/eicu-crd/2.0/.

- **Preprocess**: The preprocessing programs can be found in folder "*preprocess/pre_eicu*". They were developed upon a [previous work](https://github.com/mostafaalishahi/eICU_Benchmark).

### PhysioNet 2012

- **Obtain**: We used PhysioNet 2012 v1.0, which can be downloaded from https://physionet.org/content/challenge-2012/1.0.0/. 

- **Preprocess**: The automatic downloading and preprocessing code is in file "*experiments/data_physionet12.py*". This file was built upon a [previous program](https://github.com/mbilos/neural-flows-experiments/blob/master/nfe/experiments/latent_ode/physionet.py).

### P12 P19

For dataset P19 and P12. We use the data processed by Raindrop.

The raw data can be found at:

(1) P19: https://physionet.org/content/challenge-2019/1.0.0/

(2) P12: https://physionet.org/content/challenge-2012/1.0.0/

The datasets processed by Raindrop can be obtained at:

https://github.com/mims-harvard/Raindrop

## Experiments

## Running the code


Run GSNF on PhysioNet 2012 dataset for time series classification:

```
python main.py --random-state 0 --ml-task biclass --ivp-solver gnn --data physionet12 --time-max 2880 --ratio-ce 1000 --latent-dim 37 --hidden-dim 128 --flow-layers 2 --batch-size 50
```

Run GSNF on Synthetic dataset for time series reconstruction:

```
python main.py --random-state 1 --ml-task synthetic --ivp-solver gnn --data synthetic --time-max 2880
```