# K-Step Lookahead Thresholding

Achieve fast convergence to K-step lookahead thresholding policy to achieve superior finite sample performance in non-episodic, finite horizon RL.

## Installation
```bash
conda env create -f environment.yml
```
## Running Experiment
```bash
python run_experiment_name.py
```
## Test Different Policies
All available policies are under policies folder. 

