# README

The video file contains clips of the trained policies in the Fetch environments. It entails behaviors learned in the second phase of pretraining, i.e. when the agent is reinforced to produce behaviors optimizing the junction state measure $Door(s)$. In the upper left corner of each clip is the name of the environment, together with the step number and the $Door(s)$ reward at the step shown. The behaviors shown are reused in the downstream learning phase. Note that the clips are a result of multiple pretraining experimental runs.
