# Kernel Metric Learning for In-Sample Off-Policy Evaluation of Deterministic RL Policies
Code for training Kernel Metric Learning for In-sample Fitted Q Evaluation (KMIFQE) 


### Train KMIFQE to get the results on Hopper-v2 which is plotted in Figure 3: 
```
python main.py
``` 