This document provides instructions for running the code used in the paper 

**"Courtroom Analogy: New Perspective on Uncertainty-Aware Classification"**, submitted to ICML 2026. 

The provided code enables reviewers to reproduce the experimental results reported in the paper and to explore various configurations of our proposed model, MoDEX. 



##### \[Overview] 

The main entry point for running MoDEX experiments is main.py. The script sequentially performs the following steps:

* 1\. Training
* 2\. Testing
* 3\. Misclassification detection
* 4\. OOD detection
* 5\. Distribution shift detection (CIFAR-10 only)



##### \[Code Structure]

* **main.py:** Main script for running MoDEX experiments. It orchestrates training, evaluation, misclassification detection, OOD detection, and distribution shift detection.
* **datasets.py:** Dataset downloading, loading, and preprocessing utilities.
* **models.py:** Model architecture definitions.
* **train.py:** Training and testing routines.
* **uq.py:** Uncertainty quantification utilities, including uncertainty measures, confidence calibration (misclassification detection), OOD detection, and distribution shift detection. 



##### \[Usage] 

To run the experiments, execute the corresponding main script with the desired arguments.

###### **Example Commands**

* [1] Classical setting: `python main.py`
* [2] Long-tailed setting: `python main.py --imbalance_factor 0.01` (or `0.1`)
* [3] Long-tailed and noisy setting: `python main.py --ID_dataset "MNIST"`


**Note**: For each experimental setting, hyperparameter configurations should be set according to those reported in the paper to reproduce the best-performing results.  



##### \[Environment Details] 

- **GPU**: NVIDIA GeForce RTX 4060
- **Python**: 3.13.11
- **PyTorch**: 2.9.1+cu126
- **Torchvision**: 0.24.1+cu126
- **NumPy**: 2.3.5
- **TQDM**: 4.67.1




##### \[Notes] 

1\. **Dataset availability**: Some dataset files (e.g., CIFAR-10-C) have been intentionally excluded due to size constraints. To fully reproduce the experiments, please download the required datasets and store them in the designated directory specified in the code. 

2\. **Error handling**: While we did not encounter any issues during experimentation, unintended errors might occur depending on the environment or settings. Please verify the dependencies and settings to ensure smooth execution.



##### \[Conclusion] 

We hope this code allows reviewers to seamlessly reproduce our experiments and evaluate the potential of MoDEX.

We appreciate your time, effort, and valuable feedback in reviewing our work. Thank you!

