The supplementary video "supp.mp4" contains a brief explanation of our proposed framework, as well as examples of the human videos used for training and rollouts of our trained policies.