#  Survival Models Evaluation Framework

This project provides a unified experimental framework to train and evaluate multiple **survival analysis models** under varying censorship levels and replications — all locally, without the need for a cluster or PBS system.

---

##  Included Models

All models are implemented in a single file [`all_models.py`](all_models.py):

| Model | Library | Function |
|--------|----------|-----------|
| **Cox Proportional Hazards** | `lifelines` | `nested_cv_coxph()` |
| **Weibull AFT** | `lifelines` | `nested_cv_weibull_aft()` |
| **Random Survival Forest (RSF)** | `scikit-survival` | `nested_cv_rsf()` |
| **Neural Multi-Task Logistic Regression (NMTLR)** | `pycox` | `nested_cv_mtlr()` |
| **DeepHit** | `pycox` | `nested_cv_deephit()` |

All models return **two DataFrames**:
- `metrics_standard`: evaluation using censored times.
- `metrics_true`: evaluation using true (uncensored) times.

Each DataFrame contains:
`fold, c_index, ibs, d_calib, mae, one_calib@t, c_index@t, ...`

---

##  Running Experiments

###  1. Run a single experiment (local)

To run one model on one dataset with a given censorship level:

```bash
python run_experiment.py \
  --dataset metabric \
  --model rsf \
  --censorship 30 \
  --n_trials 10 \
  --n_outer 5 \
  --n_inner 3 \
  --replications 5
```

Expected console output:
- Loading dataset from: ./data/metabric/metabric_30_repl_1.csv
- Loaded 1900 samples | 15 features | censorship=30% | replication=1
- Finished replication 1 | std shape=(5, 14), true shape=(5, 14)

 
Results will be saved automatically in:

```bash
./results/res_metabric/metabric_rsf.csv
```

All datasets should follow this path format:

```bash
./data/{dataset_name}/{dataset_name}_{censorship_level}_repl_{replication}.csv
```


Each dataset must contain the following columns:
- time	float	Observed time or censoring time
- event	int (0/1)	Event indicator (1 = event, 0 = censored)
- true_time	float	True event time (for evaluation only)
- Other columns	numeric	Covariates/features used for modeling



You can run all datasets and censorship levels for a specific model using the provided Bash script:

```bash
bash submit_all_jobs.sh rsf
```

This script loops through:

```bash
datasets=("nacd" "flchain" "gbmlgg" "metabric" "pbc" "support")
censor_rates=(10 20 30 50 70 90)
```

and executes for each configuration:

```bash
python run_experiment.py \
  --dataset $dataset \
  --model $model \
  --censorship $censorship \
  --n_trials 10 \
  --n_outer 5 \
  --n_inner 3 \
  --replications 5
```

You can modify submit_all_jobs.sh to adjust:
- the list of datasets
- the censorship rates
- the number of trials, folds, or replications


All metrics are computed twice:
- standard (using censored times)
- true_time (using actual event times)


Saved in:

```bash
./results/res_{dataset_name}/{dataset_name}_{model}.csv
```

Each line corresponds to one fold of one replication.

### Example:

```bash
fold | c_index | ibs | d_calib | mae | one_calib@25 | ... | evaluation_type | censorship_rate | replication
```

Predictions saved in:

```bash
./predictions/pred_{dataset_name}_{censorship}/
```

### Notes
Normalization: Only numeric columns outside [0,1] are scaled using MinMax or StandardScaler.

True times are used only for evaluation, never during training.

All results are reproducible thanks to fixed random seeds and consistent data paths.
