# Unveiling the role of uncertainty in tabular deep learning (reproduction code)

Here is a short intro into reproducing our results. 

All empirical results are obtained by launching relevant scripts in the `bin` directory with corresponding configuration files from the `exp` directory.

All plots, analysis and synthetic data generation code is in the jupyter notebooks.

There is an `env.yaml` with the conda environment we've used for all experiments. Use it + conda/mamaba/micromamba to create the exact environment that was used for the experimetns with `micromamba create -f env.yaml`.

### Embedding Scheme

To reproduce the results on the embedding pretraining see the `bin/lrlr_triplet.py`

### Results table 

Results for the Table 1 are aggregated via the `./results.ipynb` notebook.

### Notebooks

The `./notebooks` directory contains all synthetic data generation and uncertainty plotting notebooks. 

In order for notebooks to work, you'd have to run the relevant models and the `exp/catboost_unc_full` model to produce predictions which are used for plotting.

### CIFAR10 Experiments

To reproduce the CIFAR10 results you should first run `notebooks/*_Cifar.ipynb` to generate respective dataset version. Then run `exp/{model}/CIFAR10-{certain|uncertain}` as usual.
