# Hy-Q: Hybrid Q-learning that makes online RL efficient with offline dataset
This repository contains our FQI-style algorithm for the comblock environment (environment adapted from BRIEE(https://arxiv.org/abs/2202.00063)).

## Run our code

To reproduce our result in comblock with epsilon offline dataset, please run:
```bash
python online_main_lock.py --seed [seed]
```

Please use seed [1,12,123,1234,12345] to reproduce our results. The generation of offline data is included in the pipeline.

