# Approximate Dynamic Programming Framework

This repository contains implementations of Approximate Dynamic Programming methods, as the Tabular QL and ICQL algorithms. The code is organized to facilitate experimentation and reproducibility.

## Repository Structure

- **algos/**: Contains the implementations of the general ADP algorithm.
- **experiment_manager.py**: Manages the setup and execution of experiments.
- **experiment_executor.py**: Handles the execution of experiments as defined by the `experiment_manager.py`.
- **experiment_manager_parameter_guide.md**: Provides detailed information about the parameters that can be configured in `experiment_manager.py`.

## Installation

Ensure you have Python 3.11.3 installed. It is recommended to use [pyenv](https://github.com/pyenv/pyenv) to manage your Python version.

Create a virtual environment:

```
python -m venv venv
```

Activate the virtual environment and install the required dependencies:

```
pip install --upgrade pip setuptools
pip install -r requirements.txt
```

## Experimentation

To run and plot experiments, you can use the `run_experiments.sh` script, which internally calls `experiment_manager.py`. The structure and configurable parameters for `experiment_manager.py` are detailed in `experiment_manager_parameter_guide.md`. 

You can modify the experiment settings either by adjusting the flags in the `run_experiments.sh` script or by directly editing the input parameters in `experiment_manager.py`.

Example configurations for QL with fixed TUI, QL with increasing TUI and QL with early stopping are given in `experiment_manager.py` 

## Final Notes

For further details about the algorithms and their implementation, please refer to the code in the `algos/` folder and the documentation provided in this repository.