# Robust Linear Dueling Bandits with Post-serving Contexts

This repository contains the official implementation of the algorithm **RCDP-UCB** (Robust to Corruption, Delay, and Post-serving UCB) and the experimental scripts to reproduce the results presented in the paper.

## Prerequisites

- Python 3.8+
- PyTorch
- NumPy
- Matplotlib
- Pandas
- Scikit-learn

You can install the dependencies using:
```bash
pip install -r requirements.txt
```

## Structure

- `contextual_dueling_bandit.py`: Core implementation of the environment and algorithms.
- `main.py`: Main script to reproduce the robustness experiments (Figure 1 & 2).

## Reproducing Experiments

To reproduce the main results:
```bash
python main.py
```

## License

MIT License
