The folder `human-like_driving_model` contains demonstrations of rollout trajectories using the prior policy and CriticSMC. Each gif file also depicts the contour plots with expected future reward likelihoods as predicted by the learned critic model. Areas with brighter color represent higher likelihood. Black dots represent a sample of actions generated by the prior model. The white dot represents the action picked by CriticSMC.

The folder `toy_environment` contains demonstrations of rollout trajectories on the toy environment using CriticSMC.