<!DOCTYPE html>
<html>
    <head>
        <title> CoRe </title>
    </head>
    <body>
        <h1> CoRe agents running in various environments</h1>
        All samples are from five validation runs after 1M steps of training (unless mentioned otherwise).
        <h2> Distracting Control Suite </h2>
        <table>
            <tr>
                <th> Walker Easy </th>
                <th> <video width="420" height="84" controls> <source src="videos/CoRe-dynamic-easy-walker.mp4" type="video/mp4"> Your browser does not support the video tag.  </video> </th>
                <td> In this setting, the agent can learn to control the walker within 1M steps. </td>
            </tr>
            <tr>
                <th> Walker Medium </th>
                <th> <video width="420" height="84" controls> <source src="videos/CoRe-dynamic-medium-walker.mp4" type="video/mp4"> Your browser does not support the video tag.  </video> <br/> </th>
                <td> The agent works reasonably well but occasional failures happen.</td>
            </tr>
            <tr>
                <th> Walker Hard </th>
                <th> <video width="420" height="84" controls> <source src="videos/CoRe-dynamic-hard-walker.mp4" type="video/mp4"> Your browser does not support the video tag.  </video> </th>
                <td> Failures happen frequently at 1M steps of training.</td>
            </tr>
            <tr>
                <th> Walker Hard at 2M steps </th>
                <th> <video width="420" height="84" controls> <source src="videos/CoRe-dynamic-hard-walker-2M.mp4" type="video/mp4"> Your browser does not support the video tag.  </video> </th>
                <td> Failures almost gone at 2M steps of training. </td>
            </tr>
            <tr>
                <th> Cartpole Easy </th>
                <th> <video width="420" height="84" controls> <source src="videos/CoRe-dynamic-easy-cartpole.mp4" type="video/mp4"> Your browser does not support the video tag.  </video> </th>
                <td> Learns to balance the pole with occasional failures. </td>
            </tr>
            <tr>
                <th> Cartpole Hard </th>
                <th> <video width="420" height="84" controls> <source src="videos/CoRe-dynamic-hard-cartpole.mp4" type="video/mp4"> Your browser does not support the video tag.  </video> </th>
                <td> Cannot solve the task at 1M steps. </td>
            </tr>
            <tr>
                <th> Cartpole Hard at 2M steps </th>
                <th> <video width="420" height="84" controls> <source src="videos/CoRe-dynamic-hard-cartpole-2M.mp4" type="video/mp4"> Your browser does not support the video tag.  </video> </th>
                <td> Success happens but failure still common at 2M steps. </td>
            </tr>
        </table>
        <h2> Robosuite Door Opening </h2>

        <table>
            <tr>
                <th> Panda static </th>
                <th> <video width="640" height="128" controls> <source src="videos/CoRe-static-camera-color-lighting-Panda-Door-2.mp4" type="video/mp4"> Your browser does not support the video tag.  </video> </th>
            </tr>
            <tr>
                <th> Panda dynamic </th>
                <th> <video width="640" height="128" controls> <source src="videos/CoRe-dynamic-camera-color-lighting-Panda-Door.mp4" type="video/mp4"> Your browser does not support the video tag.  </video> <br/> </th>
            </tr>
            <tr>
                <th> Jaco static </th>
                <th> <video width="640" height="128" controls> <source src="videos/CoRe-static-camera-color-lighting-Jaco-Door.mp4" type="video/mp4"> Your browser does not support the video tag.  </video> </th>
            </tr>
        </table>

        <h1> Visualization using gating masks </h1>
        <table>
            <tr>
                <th> Cheetah </th>
                <th> <video width="840" height="168" controls> <source src="videos/CoRe-dynamic-hard-cheetah-gating.mp4" type="video/mp4"> Your browser does not support the video tag.  </video> </th>
            </tr>
            <tr>
                <th> Reacher </th>
                <th> <video width="840" height="168" controls> <source src="videos/CoRe-dynamic-hard-reacher-gating.mp4" type="video/mp4"> Your browser does not support the video tag.  </video> </th>
            </tr>
            <tr>
                <th> Finger </th>
                <th> <video width="840" height="168" controls> <source src="videos/CoRe-dynamic-hard-finger-gating.mp4" type="video/mp4"> Your browser does not support the video tag.  </video> </th>
            </tr>
            <tr>
                <th> Door Opening </th>
                <th> <video width="640" height="256" controls> <source src="videos/CoRe-robosuite-gating.mp4" type="video/mp4"> Your browser does not support the video tag.  </video> </th>
            </tr>
        </table>
        Such masks are not obtained when we use the same gating-enabled encoders with baseline SAC+RAD model.<br/>
        <video width="640" height="256" controls> <source src="videos/SAC-dynamic-hard-background-only-cheetah.mp4" type="video/mp4"> Your browser does not support the video tag.  </video>

        <h1> Long-term predictions </h1>
        Since our model avoids the use of reconstruction during training, we do not have a pixel decoder to help visualize the model's long-term predictions. However, we can obtain such a decoder just for probing the latent space using parallel distraction-free versions of the observations. We train a pixel decoder to predict the distraction-free observations from the model's <b> detached </b> contrastive prediction obtained by applying a stop-gradient operation on it. <br/>

        <b> Top </b>: the agent's observations. <b> Middle </b>: the
        predicted next step observation from the model's latent state using the
        trained probe. <b> Bottom </b> the decoded observations from the
        obtained by rolling out the model's latent state in imagination.
        The imagination rollout starts after 50 time steps (when the green
        border switches to blue).
        
        <table>
            <tr>
                <th> Cheetah </th>
                <th> Reacher </th>
                <th> Walker </th>
            </tr>
            <tr>
                <th> <video width="420" height="252" controls> <source src="videos/longterm-cheetah.mp4" type="video/mp4"> Your browser does not support the video tag.  </video> </th>
                <th> <video width="420" height="252" controls> <source src="videos/longterm-reacher.mp4" type="video/mp4"> Your browser does not support the video tag.  </video> </th>
                <th> <video width="420" height="252" controls> <source src="videos/longterm-walker.mp4" type="video/mp4"> Your browser does not support the video tag.  </video> </th>
            </tr>
        </table>
    </body>
</html>
