<!DOCTYPE html>
<html lang="en">
<head>
  <meta charset="UTF-8">
  <meta name="viewport" content="width=device-width, initial-scale=1.0">
  <title>Generative Perception of Shape and Material from Differential Motion</title>
  <!-- Stylesheets -->
  <link rel="stylesheet" href="css/bootstrap.min.css">
  <link rel="stylesheet" href="css/app.css">
  <!-- Optional: Font Awesome -->
  <link rel="stylesheet" href="css/font-awesome.min.css">
</head>
<body>
  <div class="container text-center" id="header">
    <h1 id="paper-title">Generative Perception of Shape and Material from Differential Motion</h1>
    <p id="authors">NeurIPS 2025 Submission #842 Supplementary Materials (Visualizations) </p>
  </div>

  <!-- Results Visualization -->
  <div class="container" id="results-visualization">
    <h2>Results Visualizations</h2>
      <div class="text-justify">
      We visualize the input video sequence, predicted surface normal and albedo (diffuse color) from a moving object.<br><br>
      (1) Objects with synthetic textures:<br><br>
    </div>
    
    <!-- Row of 3 videos -->
    <div class="row">
      <div class="col-md-4 video-wrapper text-center">
        <video class="video" id="e1" loop playsinline autoPlay muted src="video/cloud.mp4" onplay="resizeAndPlay(this)"></video>
      </div>
      <div class="col-md-4 video-wrapper text-center">
        <video class="video" id="e2" loop playsinline autoPlay muted src="video/piggy.mp4" onplay="resizeAndPlay(this)"></video>
      </div>
      <div class="col-md-4 video-wrapper text-center">
        <video class="video" id="e3" loop playsinline autoPlay muted src="video/plant.mp4" onplay="resizeAndPlay(this)"></video>
      </div>
    </div>

    <!-- Add more <div class="row"> blocks if you have additional videos -->
        <!-- Row of 3 videos -->
      <div class="text-justify">
      <br> (2) Objects with original, artists' designed textures:<br><br>
    </div>
    <div class="row">
      <div class="col-md-4 video-wrapper text-center">
        <video class="video" id="e4" loop playsinline autoPlay muted src="video/honeyjar.mp4" onplay="resizeAndPlay(this)"></video>
      </div>
      <div class="col-md-4 video-wrapper text-center">
        <video class="video" id="e5" loop playsinline autoPlay muted src="video/cookies.mp4" onplay="resizeAndPlay(this)"></video>
      </div>
      <div class="col-md-4 video-wrapper text-center">
        <video class="video" id="e6" loop playsinline autoPlay muted src="video/spoon.mp4" onplay="resizeAndPlay(this)"></video>
      </div>
    </div>
    <!-- Add more <div class="row"> blocks if you have additional videos -->
  </div>
  <div class="container" id="video-section">
    <h2>Motion Disambiguation - Perception Test</h2>
    <!-- Video Placeholder 1 -->
    <div class="text-justify">
    Visualization of the shiny/matte perception test designed by Hartung and Kersten. Note in the croissant shape example the static frames can look similar for both the shiny and diffuse material. Motion cues help to disambiguate the materials.<br><br>
    </div>
<div class="row">
  <div class="col-md-4 video-wrapper text-center">
    <figure style="margin:0;">
      <video
        class="video" id="p1" loop playsinline autoplay muted src="video/pot_shiny.mp4" onplay="resizeAndPlay(this)"
        style="height:120px; width:auto; object-fit:contain; display:block;"></video>
      <figcaption style="margin-top:.5rem; font-size:2rem; color:#666;">
        Pot (shiny)
      </figcaption>
    </figure>
  </div>
  <div class="col-md-4 video-wrapper text-center">
    <figure style="margin:0;">
      <video
        class="video" id="p2" loop playsinline autoplay muted src="video/croissant_shiny.mp4" onplay="resizeAndPlay(this)"
        style="height:120px; width:auto; object-fit:contain; display:block;"></video>
      <figcaption style="margin-top:.5rem; font-size:2rem; color:#666;">
        Croissant (shiny)
      </figcaption>
    </figure>
  </div>
  <div class="col-md-4 video-wrapper text-center">
    <figure style="margin:0;">
      <video
        class="video" id="p3" loop playsinline autoplay muted src="video/croissant_diffuse.mp4" onplay="resizeAndPlay(this)"
        style="height:120px; width:auto; object-fit:contain; display:block;"
      ></video>
      <figcaption style="margin-top:.5rem; font-size:2rem; color:#666;">
        Croissant (diffuse)
      </figcaption>
    </figure>
  </div>
</div>
</div>

  <div class="container" id="video-section">
    <h2>Extending to Longer Observations</h2>
    <!-- Video Placeholder 1 -->
    <div class="text-justify">
    We apply temporal consistency guidance on the predictions of the overlapping frame to extend our model to longer video sequence (13 frames instead of the default 3 frames). This guided sampling method allows temporally consistent shape and material estimation. We compare with the shape prediction result from StableNormal, using their official video mode demo.<br><br>
    </div>

  <div class="row" style="display:flex; gap:1rem;">
    <!-- 1st video: flex 3 -->
    <div class="col-md-4 video-wrapper text-center" style="flex:3;">
      <figure style="margin:0;">
        <video
          class="video"
          id="l1"
          loop
          playsinline
          autoplay
          muted
          src="video/long_dance_fps10.mp4"
          onplay="resizeAndPlay(this)"
          style="height:120px; width:auto; object-fit:contain; display:block;"
        ></video>
        <figcaption style="margin-top:.5rem; font-size:2rem; color:#666;">
          Ours
        </figcaption>
      </figure>
    </div>

    <!-- 2nd video: flex 1 -->
    <div class="col-md-4 video-wrapper text-center" style="flex:1;">
      <figure style="margin:0;">
        <video
          class="video"
          id="l2"
          loop
          playsinline
          autoplay
          muted
          src="video/sn_dance.mp4"
          onplay="resizeAndPlay(this)"
          style="height:120px; width:auto; object-fit:contain; display:block;"
        ></video>
        <figcaption style="margin-top:.5rem; font-size:2rem; color:#666;">
          StableNormal
        </figcaption>
      </figure>
    </div>

    <!-- 3rd video: flex 3 -->
    <div class="col-md-4 video-wrapper text-center" style="flex:3;">
      <figure style="margin:0;">
        <video
          class="video"
          id="l3"
          loop
          playsinline
          autoplay
          muted
          src="video/long_sphere_fps10.mp4"
          onplay="resizeAndPlay(this)"
          style="height:120px; width:auto; object-fit:contain; display:block;"
        ></video>
        <figcaption style="margin-top:.5rem; font-size:2rem; color:#666;">
          Ours
        </figcaption>
      </figure>
    </div>

    <!-- 4th video: flex 1 -->
    <div class="col-md-4 video-wrapper text-center" style="flex:1;">
      <figure style="margin:0;">
        <video
          class="video"
          id="l4"
          loop
          playsinline
          autoplay
          muted
          src="video/sn_sphere.mp4"
          onplay="resizeAndPlay(this)"
          style="height:120px; width:auto; object-fit:contain; display:block;"
        ></video>
        <figcaption style="margin-top:.5rem; font-size:2rem; color:#666;">
          StableNormal
        </figcaption>
      </figure>
    </div>
  </div>
</div>

    <!-- Add more video blocks as needed -->
  </div>

  <!-- Optional: Scripts -->
  <script src="js/jquery.min.js"></script>
  <script src="js/bootstrap.min.js"></script>
  <script src="js/app.js"></script>
</body>
</html>

