<!DOCTYPE html>
<html lang="en" style="scroll-behavior: smooth;">
<head>
    <meta charset="UTF-8">
    <link rel="stylesheet" href="Media/Fonts/inter.css">
    <meta name="viewport" content="width=device-width, initial-scale=1.0">
    <title>Online World Modeling for Real-World IRLfO</title>
    <link rel="stylesheet" href="styles.css">
    <style>.full-video-link { display: none !important; }</style>
    <script src="script.js"></script>
</head>
<body>
    
    <section class="video-section">
        <div class="video-container" id="hero-video-container">
            <img class="hero-poster" src="Media/Video/Background/poster.jpg" alt="" aria-hidden="true">
            <video id="hero-video" autoplay muted loop playsinline preload="metadata">
                <source src="Media/Video/Background/background.mp4" type="video/mp4">
                Your browser does not support video playback.
            </video>
        </div>
        <div class="video-overlay">
            <h1>Online World Modeling for Real-World IRLfO</h1>
            <div class="hero-links">
                
                
                
            </div>
            <!-- Authors Section -->
            <div id="authors-section" class="authors-section authors-section--always-visible">
                <div class="authors-list">
                    <span style="color:inherit;font-weight:500;">Anonymous Authors</span>
                </div>
                <div class="authors-affiliation"></div>
            </div>
        </div>
    </section>

    
    <nav class="timeline-nav" id="timeline-nav">
        <div class="timeline-line"></div>
        <a href="#summary-section" class="timeline-item active" data-section="summary">
            <span class="timeline-dot"></span>
            <span class="timeline-label">Overview</span>
        </a>
        <a href="#method-section" class="timeline-item" data-section="method">
            <span class="timeline-dot"></span>
            <span class="timeline-label">Method</span>
        </a>
        <a href="#efficiency-section" class="timeline-item" data-section="results">
            <span class="timeline-dot"></span>
            <span class="timeline-label">Results</span>
        </a>
        <a href="#generalization-section" class="timeline-item" data-section="generalization">
            <span class="timeline-dot"></span>
            <span class="timeline-label">Generalization</span>
        </a>
    </nav>

    
    <div class="task-buttons-fixed" id="task-buttons-left">
        <div class="task-btn-wrapper active" data-task="push">
            <div class="task-icon push-icon">
                <div class="icon-block"></div>
                <div class="icon-block-dest"></div>
                <div class="icon-arrow">→</div>
            </div>
            <button class="task-btn active" data-task="push" onclick="selectTask('push')">Push</button>
        </div>
        <div class="task-btn-wrapper" data-task="pick-place">
            <div class="task-icon pick-icon">
                <div class="pick-arrow arrow-up">↑</div>
                <div class="pick-arrow arrow-right">→</div>
                <div class="pick-arrow arrow-down">↓</div>
                <div class="icon-block"></div>
                <div class="icon-block-dest"></div>
            </div>
            <button class="task-btn" data-task="pick-place" onclick="selectTask('pick-place')">Pick and Place</button>
        </div>
    </div>

    
    <div class="container">
        
        <div id="summary-section-container">
<section class="project-info scroll-section" id="summary-section">
    <div class="project-info-inner">
        <h2 class="section-header">
            Observational Learning for Robots
            <span class="section-subtitle">
                Animals and humans often learn by observing tasks being performed by others.
                We can then learn to perform the task ourselves according to what we saw. 
                If robots could learn this way, they would no longer require expert supervision from hand-designed
                <span class="term term--bold" data-tooltip="Mathematical reward definitions can be challenging to design.

                For example, specifying pushing a block to a location with only camera images may require complex computer vision algorithms to first identify then reward.">rewards</span>
                or
                <span class="term term--bold" data-tooltip="Remotely controlling a robot via human operator. This can be challenging for complex robots (like humanoids or tactile hands) or for non-expert users!">tele-operation</span>.
            </span>
        </h2>
        <p>
        </p>
        <br>
        <!-- </div> -->
        <!-- <div class="highlight-box">
            We introduce <strong>MPAIL2</strong> for real-world observational learning on robots
        </div> -->
        <div class="teaser-video-wrapper">
            <div class="video-display teaser-video" style="overflow: hidden;">
                <video autoplay muted loop playsinline style="width: 100%; height: 100%; object-fit: cover;">
                    <source src="Media/Video/teaser.mp4" type="video/mp4">
                </video>
            </div>
        </div>
        <br>
        <!-- <h2>Model Predictive Adversarial Imitation Learning 2 (MPAIL2)</h2> -->
        <h2>Key Results</h2>
        <div class="highlight-box-group">
            <div class="highlight-box highlight-box--emerald">
                <strong>A First in Real-World Observational Learning</strong>
                <br>
                This work demonstrates the first work in observational learning 
                (<span class="term term--bold" data-tooltip-html="Inverse Reinforcement Learning from Observation. IRLfO is a type of inverse reinforcement learning that learns a reward function from observations of a task being performed by an expert.
                <br>
                <br>
                <i>While other observational learning frameworks exist, they do not readily admit improvement with experience.</i>
                ">IRLfO</span>)
                <span class="term term--bold" data-tooltip-html="Achieved without offline pre-training, prior data, models, or simulations. All learning is done in the real-world with model weights initialized randomly.
                <br>
                <br>
                While it is possible to pre-train MPAIL2, this work aims to remain as general as possible. For instance, non-manipulation settings often do not have readily available pre-trained policy models.
                "
                >purely in the real-world from scratch</span>.
            </div>
            <div class="highlight-box highlight-box--emerald">
                <strong>A First in Transfer Learning</strong>
                <br>
                This work is the first to demonstrate
                <span class="term term--bold" data-tooltip-html="Currently one of the primary differences between man and machine, Transfer learning is the process of transferring knowledge from one task to another.
                </i>
                ">transfer learning</span> from scratch in the real-world.
            </div>
            <div class="highlight-box highlight-box--orange">
                <strong>Real-World Sample Efficiency</strong>
                <br>
                Where a state-of-the-art baseline in RL with demonstrations (<span class="term term--bold" data-tooltip-html="<a href='https://proceedings.mlr.press/v202/ball23a/ball23a.pdf' target='_blank'>Reinforcement Learning with Prior Data (RLPD)</a>, Ball et al. 2023.
                ">RLPD</span>) sees no success after over an hour of real-world training, MPAIL2 sees consistent success in under 40 minutes.
            </div>
        </div>
    </div>
</section>
</div>
        
        <div id="method-section-container">
<section class="project-info scroll-section" id="method-section">
    <div class="project-info-inner">
        <h2 class="section-header">The Algorithm - MPAIL2
            <span class="section-subtitle">
                We introduce Model Predictive Adversarial Imitation Learning 2 (MPAIL2), an
                <span class="term term--bold" data-tooltip-html="Inverse Reinforcement Learning (IRL) learns a reward function in addition to a policy using demonstration data.
                ">IRL</span> algorithm
                that learns a
                <span class="term term--bold" data-tooltip-html="A world model predicts the next state of the world based on the current state and action.
                ">world model</span>
                and performs
                <span class="term term--bold" data-tooltip-html="Rather directly predicting the learner's observations (e.g. camera images), latent planning operates on reduced, low-dimensional representations of the world, enabling reactive planning and more efficient learning.
                ">latent planning</span>.
            </span>
        </h2>
        <div style="margin: 20px auto 0 auto;">
            <div class="summary-main-image" style="margin-bottom: 0;"><img src="Media/Image/mpail2.drawio.png" alt="Method Overview"></div>
        </div>
        <br>
        <div class="card-grid">
            <div class="info-card">
                <p><strong>1.</strong>&nbsp;Five (5) components comprise MPAIL2's world model:
                encoder, dynamics, reward, value, and policy.
                Online, a planner uses these components to sample plans, predict their outcomes, evaluate them, and finally select a plan.</p>
                <p><strong>2.</strong>&nbsp;The learner's task is given through observations of task demonstrations, like watching a person cook. They are used only when learning the reward.
                <br>
                <br>
                <i>
                See this in action on the right. Notice that the block's predicted trajectory (orange) goes from static to dynamic as the robot (green) plans through it.
                </i>
            </div>
            <div class="card-outline">
                <div class="summary-main-image" style="flex: 1; margin-bottom: 0;"><img src="Media/Video/Vis/overlay.gif" alt="Method Overview"></div>
                <video autoplay muted loop playsinline style="width: 100%; border-radius: 4px; display: block; margin-top: 12px;">
                    <source src="Media/Video/Vis/rollout-vis.mp4" type="video/mp4">
                </video>
                <i>
                    <small>
                        Note that planning occurs in latent space. This visualization is made possible by training a separate decoder.
                    </small>
                </i>
            </div>
        </div>
    </div>
</section>
</div>

        <div id="efficiency-section-container">
        <section class="tasks-section scroll-section" id="efficiency-section">
            <h2 class="section-header" id="experiments-section">Results
                <span class="section-subtitle">
                    All experiments are trained <strong>entirely from scratch in the real world</strong> with <strong>no pre-training or prior data of any kind</strong> (beyond task observations). Behavior Cloning via Diffusion Policy is also evaluated and its results can be viewed in the evaluation tables as it is an offline algorithm.
                    <br>
                </span>
            </h2>
            <h2 class="section-header experiment-sub-header">Sample Efficiency
                <span class="section-subtitle">Watch the robot train from scratch in the real world using various methods. RL has been notoriously inefficient at real-world training for various reasons.</span>
            </h2>

            <div class="content-block" id="demonstrations-block">
                <h3 class="subsection-header">Demonstrations</h3>
                <div class="demonstrations-grid" id="demonstrations-grid">
                    <div class="demo-video-wrapper">
                        <div class="video-display">
                            <video class="demo-video" autoplay muted loop playsinline>
                                <source src="Media/Video/Demonstrations/Push/demo_1.mp4" type="video/mp4">
                            </video>
                        </div>
                    </div>
                    <div class="demo-video-wrapper">
                        <div class="video-display">
                            <video class="demo-video" autoplay muted loop playsinline>
                                <source src="Media/Video/Demonstrations/Push/demo_2.mp4" type="video/mp4">
                            </video>
                        </div>
                    </div>
                    <div class="demo-video-wrapper">
                        <div class="video-display">
                            <video class="demo-video" autoplay muted loop playsinline>
                                <source src="Media/Video/Demonstrations/Push/demo_3.mp4" type="video/mp4">
                            </video>
                        </div>
                    </div>
                    <div class="demo-video-wrapper">
                        <div class="video-display">
                            <video class="demo-video" autoplay muted loop playsinline>
                                <source src="Media/Video/Demonstrations/Push/demo_4.mp4" type="video/mp4">
                            </video>
                        </div>
                    </div>
                    <div class="demo-video-wrapper">
                        <div class="video-display">
                            <video class="demo-video" autoplay muted loop playsinline>
                                <source src="Media/Video/Demonstrations/Push/demo_5.mp4" type="video/mp4">
                            </video>
                        </div>
                    </div>
                    <div class="demo-video-wrapper">
                        <div class="video-display">
                            <video class="demo-video" autoplay muted loop playsinline>
                                <source src="Media/Video/Demonstrations/Push/demo_6.mp4" type="video/mp4">
                            </video>
                        </div>
                    </div>
                    <div class="demo-video-wrapper">
                        <div class="video-display">
                            <video class="demo-video" autoplay muted loop playsinline>
                                <source src="Media/Video/Demonstrations/Push/demo_7.mp4" type="video/mp4">
                            </video>
                        </div>
                    </div>
                    <div class="demo-video-wrapper">
                        <div class="video-display">
                            <video class="demo-video" autoplay muted loop playsinline>
                                <source src="Media/Video/Demonstrations/Push/demo_8.mp4" type="video/mp4">
                            </video>
                        </div>
                    </div>
                    <div class="demo-video-wrapper">
                        <div class="video-display">
                            <video class="demo-video" autoplay muted loop playsinline>
                                <source src="Media/Video/Demonstrations/Push/demo_9.mp4" type="video/mp4">
                            </video>
                        </div>
                    </div>
                    <div class="demo-video-wrapper">
                        <div class="video-display">
                            <video class="demo-video" autoplay muted loop playsinline>
                                <source src="Media/Video/Demonstrations/Push/demo_10.mp4" type="video/mp4">
                            </video>
                        </div>
                    </div>
                </div>
                <div class="demonstrations-caption">Only these <span style="font-weight:700; font-size:1.15em; color:#667eea;">10</span> Demonstrations are provided to the learner.</div>
            </div>

            
            <div class="content-block" id="time-lapses-block">
                <h3 class="subsection-header">Training Time-lapse</h3>

                
                <div class="iteration-control-area">
                    <div class="iteration-info-row">
                        <div class="iteration-info-item">
                            <label>Iteration:</label>
                            <span class="value" id="iteration-display">0</span>
                        </div>
                        <div class="iteration-info-item">
                            <label>Time Spent:</label>
                            <span class="value" id="time-spent">0</span>
                        </div>
                    </div>
                    <div class="slider-track-container">
                        <input
                            type="range"
                            class="iteration-slider"
                            id="iteration-slider" 
                            min="0"
                            max="100"
                            value="0"
                            step="10"
                            oninput="updateIteration(this.value)"
                        >
                        <div class="slider-dots" id="slider-dots"></div>
                        <div class="slider-checkmarks" id="slider-checkmarks"></div>
                        <div class="slider-hint">Tip: drag to scrub through training iterations.</div>
                    </div>
                </div>

                <div class="transfer-videos-container">
                    <div class="transfer-video-wrapper">
                        <p class="video-label highlighted sample-mpail2-title" style="margin-bottom: 10px;">MPAIL2</p>
                        <div class="video-display">
                            <video id="task-video" autoplay muted loop playsinline>
                                <source src="Media/Video/Push/iter_0.mp4" type="video/mp4">
                                Your browser does not support video playback.
                            </video>
                        </div>
                        <a class="full-video-link" style="margin-top: 20px;" onclick="openVideoModal('sample')">
                            <svg viewBox="0 0 24 24"><path d="M8 5v14l11-7z"/></svg>
                            Full Training Video
                        </a>
                    </div>
                    <div class="transfer-video-wrapper">
                        <div style="display: flex; align-items: center; justify-content: center; gap: 12px; margin-bottom: 10px; flex-wrap: nowrap;">
                            <div style="white-space: nowrap;">
                                <strong>Compare:</strong>
                            </div>
                            <div class="sample-method-row" style="justify-content: center; margin-bottom: 0; flex-wrap: nowrap;">
                                <button
                                    type="button"
                                    class="sample-method-btn active"
                                    data-method="mpail2-p"
                                    onclick="selectSampleMethod('mpail2-p')"
                                    data-tooltip-html="MPAIL2 without online planning.<br>Uses the learned explicit policy only, with no lookahead planning at test time. Similar to <a href='https://ieeexplore.ieee.org/document/9361118' target='_blank'>Model-based AIRL (MAIRL)</a>, Sun et al. 2021"
                                >
                                    Without Planning
                                </button>
                                <button
                                    type="button"
                                    class="sample-method-btn"
                                    data-method="mpail2-pm"
                                    onclick="selectSampleMethod('mpail2-pm')"
                                    data-tooltip-html="MPAIL2 Implementation without dynamics model usage (e.g. Q-targets, policy optimization). Similar to <a href='https://arxiv.org/abs/1809.02925' target='_blank'>DAC</a>, Kostrikov et al. 2018"
                                >
                                    Model-Free
                                </button>
                                <button
                                    type="button"
                                    class="sample-method-btn"
                                    data-method="rlpd"
                                    onclick="selectSampleMethod('rlpd')"
                                    data-tooltip-html=" <a href='https://proceedings.mlr.press/v202/ball23a/ball23a.pdf' target='_blank'>Reinforcement Learning with Prior Data (RLPD)</a>, Ball et al. 2023<br>State-of-the-art model-free baseline that leverages demonstration data."
                                >
                                    RLPD
                                </button>
                            </div>
                        </div>
                        <div class="video-display">
                            <video id="sample-method-video" autoplay muted loop playsinline>
                                <source src="Media/Video/Comparison/mpail2_p/Push/iter_0.mp4" type="video/mp4">
                                Your browser does not support video playback.
                            </video>
                        </div>
                        <p class="video-label" id="sample-method-label" style="display:none;">Without Planning</p>
                        <a class="full-video-link" style="margin-top: 20px;" onclick="openVideoModal('comparison')">
                            <svg viewBox="0 0 24 24"><path d="M8 5v14l11-7z"/></svg>
                            Full Training Video
                        </a>
                    </div>
                </div>

            <div class="content-block" id="results-block">
                <div class="results-panel">
                    <h3 class="subsection-header">Plots and Tables</h3>
                    <div class="collapsible-body">
                        <div style="display: flex; gap: 10px; justify-content: center; align-items: center; max-width: 1200px; margin: 5px auto 0 auto;" id="results-images">
                            <div style="flex: 1; max-width: 550px;">
                                <img id="results-chart-img" src="Media/Image/Results/Push_chart.png" alt="Results Chart" style="width: 100%; border-radius: 8px;">
                            </div>
                            <div style="flex: 1; max-width: 550px;">
                                <img id="results-table-img" src="Media/Image/Results/Push_table.png" alt="Results Table" style="width: 100%; border-radius: 8px;">
                            </div>
                        </div>
                    </div>
                </div>
            </div>
            </div>
        </section>
</div>
        <div id="transfer-section-container">
        <section class="tasks-section scroll-section" id="transfer-section">
            <h2 class="section-header experiment-sub-header">Transfer Learning
                <span class="section-subtitle">Robot is trained using an initial set of demonstrations for one task. Then, the model weights are directly transferred for training on a new set of demonstrations for a different task. Since the tasks are similar, we expect the robot to learn new skills quickly by relying on its continual experience. However, this has not been true of most RL methods without specialized methods or massive data coverage.</span>
            </h2>

            <div class="content-block" id="transfer-initial-demonstrations-block">
                <h3 class="subsection-header">Demonstrations</h3>
                <div class="transfer-demo-toggle">
                    <button type="button" class="sample-method-btn transfer-demo-btn active" data-transfer-demo="initial" onclick="switchTransferDemo('initial')">Initial Task</button>
                    <button type="button" class="sample-method-btn transfer-demo-btn" data-transfer-demo="new" onclick="switchTransferDemo('new')">New Task</button>
                </div>
                <div class="transfer-demo-panel" id="transfer-demo-panel-initial">
                <div class="demonstrations-grid" id="initial-demonstrations-grid">
                    <div class="demo-video-wrapper">
                        <div class="video-display">
                            <video class="initial-demo-video" autoplay muted loop playsinline>
                                <source src="Media/Video/Demonstrations/Push/demo_1.mp4" type="video/mp4">
                            </video>
                        </div>
                    </div>
                    <div class="demo-video-wrapper">
                        <div class="video-display">
                            <video class="initial-demo-video" autoplay muted loop playsinline>
                                <source src="Media/Video/Demonstrations/Push/demo_2.mp4" type="video/mp4">
                            </video>
                        </div>
                    </div>
                    <div class="demo-video-wrapper">
                        <div class="video-display">
                            <video class="initial-demo-video" autoplay muted loop playsinline>
                                <source src="Media/Video/Demonstrations/Push/demo_3.mp4" type="video/mp4">
                            </video>
                        </div>
                    </div>
                    <div class="demo-video-wrapper">
                        <div class="video-display">
                            <video class="initial-demo-video" autoplay muted loop playsinline>
                                <source src="Media/Video/Demonstrations/Push/demo_4.mp4" type="video/mp4">
                            </video>
                        </div>
                    </div>
                    <div class="demo-video-wrapper">
                        <div class="video-display">
                            <video class="initial-demo-video" autoplay muted loop playsinline>
                                <source src="Media/Video/Demonstrations/Push/demo_5.mp4" type="video/mp4">
                            </video>
                        </div>
                    </div>
                    <div class="demo-video-wrapper">
                        <div class="video-display">
                            <video class="initial-demo-video" autoplay muted loop playsinline>
                                <source src="Media/Video/Demonstrations/Push/demo_6.mp4" type="video/mp4">
                            </video>
                        </div>
                    </div>
                    <div class="demo-video-wrapper">
                        <div class="video-display">
                            <video class="initial-demo-video" autoplay muted loop playsinline>
                                <source src="Media/Video/Demonstrations/Push/demo_7.mp4" type="video/mp4">
                            </video>
                        </div>
                    </div>
                    <div class="demo-video-wrapper">
                        <div class="video-display">
                            <video class="initial-demo-video" autoplay muted loop playsinline>
                                <source src="Media/Video/Demonstrations/Push/demo_8.mp4" type="video/mp4">
                            </video>
                        </div>
                    </div>
                    <div class="demo-video-wrapper">
                        <div class="video-display">
                            <video class="initial-demo-video" autoplay muted loop playsinline>
                                <source src="Media/Video/Demonstrations/Push/demo_9.mp4" type="video/mp4">
                            </video>
                        </div>
                    </div>
                    <div class="demo-video-wrapper">
                        <div class="video-display">
                            <video class="initial-demo-video" autoplay muted loop playsinline>
                                <source src="Media/Video/Demonstrations/Push/demo_10.mp4" type="video/mp4">
                            </video>
                        </div>
                    </div>
                </div>
                <div class="demonstrations-caption">Only these <span style="font-weight:700; font-size:1.15em; color:#667eea;">10</span> Demonstrations are provided to the learner.</div>
                </div>
                <div class="transfer-demo-panel hidden" id="transfer-demo-panel-new">
                <div class="demonstrations-grid" id="transfer-demonstrations-grid">
                    <div class="demo-video-wrapper">
                        <div class="video-display">
                            <video class="transfer-demo-video" autoplay muted loop playsinline>
                                <source src="Media/Video/Demonstrations/Transfer/Push/demo_1.mp4" type="video/mp4">
                            </video>
                        </div>
                    </div>
                    <div class="demo-video-wrapper">
                        <div class="video-display">
                            <video class="transfer-demo-video" autoplay muted loop playsinline>
                                <source src="Media/Video/Demonstrations/Transfer/Push/demo_2.mp4" type="video/mp4">
                            </video>
                        </div>
                    </div>
                    <div class="demo-video-wrapper">
                        <div class="video-display">
                            <video class="transfer-demo-video" autoplay muted loop playsinline>
                                <source src="Media/Video/Demonstrations/Transfer/Push/demo_3.mp4" type="video/mp4">
                            </video>
                        </div>
                    </div>
                    <div class="demo-video-wrapper">
                        <div class="video-display">
                            <video class="transfer-demo-video" autoplay muted loop playsinline>
                                <source src="Media/Video/Demonstrations/Transfer/Push/demo_4.mp4" type="video/mp4">
                            </video>
                        </div>
                    </div>
                    <div class="demo-video-wrapper">
                        <div class="video-display">
                            <video class="transfer-demo-video" autoplay muted loop playsinline>
                                <source src="Media/Video/Demonstrations/Transfer/Push/demo_5.mp4" type="video/mp4">
                            </video>
                        </div>
                    </div>
                    <div class="demo-video-wrapper">
                        <div class="video-display">
                            <video class="transfer-demo-video" autoplay muted loop playsinline>
                                <source src="Media/Video/Demonstrations/Transfer/Push/demo_6.mp4" type="video/mp4">
                            </video>
                        </div>
                    </div>
                    <div class="demo-video-wrapper">
                        <div class="video-display">
                            <video class="transfer-demo-video" autoplay muted loop playsinline>
                                <source src="Media/Video/Demonstrations/Transfer/Push/demo_7.mp4" type="video/mp4">
                            </video>
                        </div>
                    </div>
                    <div class="demo-video-wrapper">
                        <div class="video-display">
                            <video class="transfer-demo-video" autoplay muted loop playsinline>
                                <source src="Media/Video/Demonstrations/Transfer/Push/demo_8.mp4" type="video/mp4">
                            </video>
                        </div>
                    </div>
                    <div class="demo-video-wrapper">
                        <div class="video-display">
                            <video class="transfer-demo-video" autoplay muted loop playsinline>
                                <source src="Media/Video/Demonstrations/Transfer/Push/demo_9.mp4" type="video/mp4">
                            </video>
                        </div>
                    </div>
                    <div class="demo-video-wrapper">
                        <div class="video-display">
                            <video class="transfer-demo-video" autoplay muted loop playsinline>
                                <source src="Media/Video/Demonstrations/Transfer/Push/demo_10.mp4" type="video/mp4">
                            </video>
                        </div>
                    </div>
                </div>
                <div class="demonstrations-caption">Only these <span style="font-weight:700; font-size:1.15em; color:#667eea;">10</span> Demonstrations are provided to the learner.</div>
                </div>
            </div>

            
            <div class="content-block" id="transfer-time-lapses-block">
                <h3 class="subsection-header">Training Time-lapse</h3>

                
                <div class="iteration-control-area">
                    <div class="iteration-info-row">
                        <div class="iteration-info-item">
                            <label>Iteration:</label>
                            <span class="value" id="transfer-iteration-display">0</span>
                        </div>
                        <div class="iteration-info-item">
                            <label>Time Spent:</label>
                            <span class="value" id="transfer-time-spent">0</span>
                        </div>
                    </div>
                    <div class="slider-track-container">
                        <input
                            type="range"
                            class="iteration-slider"
                            id="transfer-iteration-slider" 
                            min="0"
                            max="100"
                            value="0"
                            step="10"
                            oninput="updateTransferIteration(this.value)"
                        >
                        <div class="slider-dots" id="transfer-slider-dots"></div>
                        <div class="slider-checkmarks" id="transfer-slider-checkmarks"></div>
                        <div class="slider-hint slider-hint--transfer">Tip: drag to compare transfer vs. from-scratch training.</div>
                    </div>
                </div>

                
                <div class="transfer-videos-container">
                    <div class="transfer-video-wrapper transferred">
                        <p class="video-label highlighted sample-mpail2-title" style="margin-bottom: 10px;">Transferred</p>
                        <div class="video-display">
                            <video id="transfer-video-left" autoplay muted loop playsinline>
                                <source src="Media/Video/Transfer/Push/iter_0.mp4" type="video/mp4">
                                Your browser does not support video playback.
                            </video>
                        </div>
                        
                        <a class="full-video-link" style="margin-top: 20px;" onclick="openVideoModal('transfer')">
                            <svg viewBox="0 0 24 24"><path d="M8 5v14l11-7z"/></svg>
                            Full Training Video
                        </a>
                    </div>
                    <div class="transfer-video-wrapper">
                        <p class="video-label" style="margin-bottom: 10px;">From Scratch</p>
                        <div class="video-display">
                            <video id="transfer-video-right" autoplay muted loop playsinline>
                                <source src="Media/Video/From%20Scratch/Push/iter_0.mp4" type="video/mp4">
                                Your browser does not support video playback.
                            </video>
                        </div>
                        <a href="#" class="full-video-link" style="margin-top: 20px;" onclick="openVideoModal('scratch'); return false;">
                            <svg viewBox="0 0 24 24"><path d="M8 5v14l11-7z"/></svg>
                            Full Training Video
                        </a>
                    </div>
                </div>
                <div class="content-block" id="transfer-results-block">
                    <div class="results-panel">
                        <h3 class="subsection-header">Plots and Tables</h3>
                        <div class="collapsible-body">
                            <div style="display: flex; gap: 10px; justify-content: center; align-items: center; max-width: 1200px; margin: 5px auto 0 auto;" id="transfer-results-images">
                                <div style="flex: 1; max-width: 550px;">
                                    <img id="transfer-results-chart-img" src="Media/Image/Results/Transfer_Push_chart.png" alt="Transfer Results Chart" style="width: 100%; border-radius: 8px;">
                                </div>
                                <div style="flex: 1; max-width: 550px;">
                                    <img id="transfer-results-table-img" src="Media/Image/Results/Transfer_Push_table.png" alt="Transfer Results Table" style="width: 100%; border-radius: 8px;">
                                </div>
                            </div>
                        </div>
                    </div>
                </div>
            </div>
        </section>
</div>
        <div id="generalization-section-container">
        <section class="tasks-section scroll-section" id="generalization-section">
            <h2 class="section-header">Generalization &amp; Minimal Observations
                <span class="section-subtitle">Some interesting videos from when we were playing with the robot.</span>
            </h2>
            <p class="demonstrations-caption">(Trained with <span style="font-weight:700; font-size:1em; color:#e6a817;">only Yellow Cube</span> in scene)</p>

            <div class="content-block">
                <h3 class="subsection-header">Generalization</h3>
                <div class="generalization-grid">
                    <div class="generalization-video-wrapper">
                        <div class="video-display">
                            <video id="gen-video-1" autoplay muted loop playsinline>
                                <source src="Media/Video/Generalization/video_1.mp4" type="video/mp4">
                                Your browser does not support video playback.
                            </video>
                        </div>
                        <div class="gen-video-label">Novel Cube 1</div>
                    </div>
                    <div class="generalization-video-wrapper">
                        <div class="video-display">
                            <video id="gen-video-2" autoplay muted loop playsinline>
                                <source src="Media/Video/Generalization/video_2.mp4" type="video/mp4">
                                Your browser does not support video playback.
                            </video>
                        </div>
                        <div class="gen-video-label">Novel Cube 2</div>
                    </div>
                    <div class="generalization-video-wrapper">
                        <div class="video-display">
                            <video id="gen-video-3" autoplay muted loop playsinline>
                                <source src="Media/Video/Generalization/video_3.mp4" type="video/mp4">
                                Your browser does not support video playback.
                            </video>
                        </div>
                        <div class="gen-video-label">Multi-Cube Scene 1</div>
                    </div>
                    <div class="generalization-video-wrapper">
                        <div class="video-display">
                            <video id="gen-video-4" autoplay muted loop playsinline>
                                <source src="Media/Video/Generalization/video_4.mp4" type="video/mp4">
                                Your browser does not support video playback.
                            </video>
                        </div>
                        <div class="gen-video-label">Multi-Cube Scene 2</div>
                    </div>
                </div>
                <div style="display: flex; justify-content: center; margin-top: 20px;">
                    <div class="generalization-video-wrapper" style="max-width: calc(50% - 12px);">
                        <div class="video-display">
                            <video id="gen-video-5" autoplay muted loop playsinline>
                                <source src="Media/Video/Generalization/video_5.mp4" type="video/mp4">
                                Your browser does not support video playback.
                            </video>
                        </div>
                        <div class="gen-video-label">Camera Disturbance</div>
                    </div>
                </div>
            </div>

            <div class="content-block">
                <h3 class="subsection-header">Minimal Observations
                    <span class="section-subtitle">Demonstrations contain <span style="font-weight:700; font-size:1em; color:#667eea;">only the table camera</span>, with no proprioceptive / wrist-mounted camera from the robot in demonstrations using a reward conditioned on a subset of the latent state. Though it takes longer, MPAIL2 can still learn in this setting, pointing towards future work in cross-embodiment training and learning from videos.</span>
                </h3>
                <br>
                <div class="generalization-grid">
                    <div class="generalization-video-wrapper">
                        <div class="video-display">
                            <video id="tablecam-video-1" autoplay muted loop playsinline>
                                <source src="Media/Video/TableCam/video_1.mp4" type="video/mp4">
                                Your browser does not support video playback.
                            </video>
                        </div>
                    </div>
                    <div class="generalization-video-wrapper">
                        <div class="video-display">
                            <video id="tablecam-video-2" autoplay muted loop playsinline>
                                <source src="Media/Video/TableCam/video_2.mp4" type="video/mp4">
                                Your browser does not support video playback.
                            </video>
                        </div>
                    </div>
                    <div class="generalization-video-wrapper">
                        <div class="video-display">
                            <video id="tablecam-video-3" autoplay muted loop playsinline>
                                <source src="Media/Video/TableCam/video_3.mp4" type="video/mp4">
                                Your browser does not support video playback.
                            </video>
                        </div>
                    </div>
                    <div class="generalization-video-wrapper">
                        <div class="video-display">
                            <video id="tablecam-video-4" autoplay muted loop playsinline>
                                <source src="Media/Video/TableCam/video_4.mp4" type="video/mp4">
                                Your browser does not support video playback.
                            </video>
                        </div>
                    </div>
                </div>
                <div class="demonstrations-caption" style="margin-top:20px;">
                    Achieved in ~500 iterations. The ArUco marker on the cube is used solely for trajectory recording, not as part of the observation.
                </div>
            </div>
        </section>
</div>

    </div>
</section>


</div>

    </div>

    
    <div class="video-modal" id="video-modal">
        <div class="modal-backdrop"></div>
        <div class="modal-content">
            <button class="modal-close" onclick="closeVideoModal()">×</button>
            <video class="modal-video" id="modal-video" controls muted>
                <!-- Full video removed for size -->
                Your browser does not support video playback.
            </video>
        </div>
    </div>

</body>
</html>
