<!DOCTYPE html>
<html lang="en">
<head>
  <meta charset="UTF-8" />
  <meta name="viewport" content="width=device-width, initial-scale=1.0" />
  <title>Media Page</title>
  <style>
    body {
      margin: 0;
      display: flex;
      justify-content: center;
      align-items: center;
      flex-direction: column;
      min-height: 100vh;
      font-family: Arial, sans-serif;
      text-align: center;
      padding: 1rem;
      box-sizing: border-box;
    }
    .container {
      padding: 2rem;
      max-width: 1500px;
      margin: 0 auto;
    }
    h1 {
      margin-bottom: 1rem;
    }
    hr {
      width: 100%;
      max-width: 800px;
      margin: 1rem 0;
      border: 0;
      border-top: 1px solid #ccc;
    }
    img {
      max-width: 800px;
      width: 100%;
      height: auto;
      margin-bottom: 1rem;
    }
    .video-section {
      display: flex;
      align-items: flex-start;
      /* gap: 1rem; */
      margin-bottom: 1rem;
    }

    .video-text {
      /* max-width: 300px;  */
      text-align: left;
    }
    .video-row {
      display: flex;
      flex-wrap: wrap;
      justify-content: center;
      gap: 1rem;
      width: 100%;
      max-width: 1000px;
      margin-bottom: 1rem;
    }
    .video-row video {
      flex: 1 1 45%;
      max-width: 45%;
    }
    .home-link {
      margin-top: 1rem;
    }
    @media (max-width: 768px) {
      .video-row video {
        max-width: 100%;
        flex-basis: 100%;
      }
    }
  </style>
</head>
<body>
<div class="container">
  <div class="home-link">
    <a href="../SupplementaryVideos.html">Home</a>
  </div>
  <hr />
  <h1>Pairwise Comparisons: Foreground Object Consistency</h1>
  <hr />

  <h3>Example 1</h3>
  <div class="video-section">
    <div class="video-row">
      <video autoplay loop muted>
        <source src="analysis_videos/fgtracker_good/fgtracker_good_0_v1.mp4" type="video/mp4" />
        Your browser does not support the video tag.
      </video>
      <video autoplay loop muted>
        <source src="analysis_videos/fgtracker_good/fgtracker_good_0_v2.mp4" type="video/mp4" />
        Your browser does not support the video tag.
      </video>
    </div>
    <div class="video-text">
      <p><b>Human:</b> <br>Video 1 <span style="color:green;">✔</span> | Video 2 <span style="color:red;">✘</span></p>
      <p><b>VB-SC:</b>
        <br>Video 1 <span style="color:red;">✘</span> | Video 2 <span style="color:green;">✔</span>
        <br>Score 1 : 0.9352 | Score 2 : 0.9358</p>
      <p><b>Tracker-FG (Ours):</b> 
        <br>Video 1 <span style="color:green;">✔</span> | Video 2 <span style="color:red;">✘</span>
        <br>Score 1 : 0.9947 | Score 2 : 0.9945</p>
    </div>
  </div>
  <p>Vbench metric (VB-SC) fails to capture the finegrained distortions near the cat's face. In addition, VB-SC tends to favour videos with lesser camera motion. Our Tracker-FG captures fine-grained long term dependencies more effectively.</p>
  <hr />

  <h3>Example 2</h3>
  <div class="video-section">
    <div class="video-row">
      <video autoplay loop muted>
        <source src="analysis_videos/fgtracker_good/fgtracker_good_1_v1.mp4" type="video/mp4" />
        Your browser does not support the video tag.
      </video>
      <video autoplay loop muted>
        <source src="analysis_videos/fgtracker_good/fgtracker_good_1_v2.mp4" type="video/mp4" />
        Your browser does not support the video tag.
      </video>
    </div>
    <div class="video-text">
      <p><b>Human:</b> <br>Video 1 <span style="color:green;">✔</span> | Video 2 <span style="color:red;">✘</span></p>
      <p><b>VB-SC:</b>
        <br>Video 1 <span style="color:red;">✘</span> | Video 2 <span style="color:green;">✔</span>
        <br>Score 1 : 0.9442 | Score 2 : 0.9567</p>
      <p><b>Tracker-FG (Ours):</b> 
        <br>Video 1 <span style="color:green;">✔</span> | Video 2 <span style="color:red;">✘</span>
        <br>Score 1 : 0.9947 | Score 2 : 0.9926</p>
    </div>
  </div>
  <p>Vbench metric (VB-SC) fails when there are multiple subjects in the scene, likely assigning every subject instance into a single feature. Tracker-FG tracks each subject individually and computes consistency.</p>
  <hr />

  <h3>Example 3</h3>
  <div class="video-section">
    <div class="video-row">
      <video autoplay loop muted>
        <source src="analysis_videos/fgtracker_good/fgtracker_good_2_v1.mp4" type="video/mp4" />
        Your browser does not support the video tag.
      </video>
      <video autoplay loop muted>
        <source src="analysis_videos/fgtracker_good/fgtracker_good_2_v2.mp4" type="video/mp4" />
        Your browser does not support the video tag.
      </video>
    </div>
    <div class="video-text">
      <p><b>Human:</b> <br>Video 1 <span style="color:green;">✔</span> | Video 2 <span style="color:red;">✘</span></p>
      <p><b>VB-SC:</b>
        <br>Video 1 <span style="color:red;">✘</span> | Video 2 <span style="color:green;">✔</span>
        <br>Score 1 : 0.5502 | Score 2 : 0.9379</p>
      <p><b>Tracker-FG (Ours):</b> 
        <br>Video 1 <span style="color:green;">✔</span> | Video 2 <span style="color:red;">✘</span>
        <br>Score 1 : 0.9812 | Score 2 : 0.9746</p>
    </div>
  </div>
  <p>Vbench metric (VB-SC) is highly sensitive to camera motion as evident in its score (0.55 compared to values close to 0.95).</p>
  <hr />

  <h3>Example 4</h3>
  <div class="video-section">
    <div class="video-row">
      <video autoplay loop muted>
        <source src="analysis_videos/fgtracker_good/fgtracker_good_3_v1.mp4" type="video/mp4" />
        Your browser does not support the video tag.
      </video>
      <video autoplay loop muted>
        <source src="analysis_videos/fgtracker_good/fgtracker_good_3_v2.mp4" type="video/mp4" />
        Your browser does not support the video tag.
      </video>
    </div>
    <div class="video-text">
      <p><b>Human:</b> <br>Video 1 <span style="color:green;">✔</span> | Video 2 <span style="color:red;">✘</span></p>
      <p><b>VB-SC:</b>
        <br>Video 1 <span style="color:red;">✘</span> | Video 2 <span style="color:green;">✔</span>
        <br>Score 1 : 0.7120 | Score 2 : 0.8132</p>
      <p><b>Tracker-FG (Ours):</b> 
        <br>Video 1 <span style="color:green;">✔</span> | Video 2 <span style="color:red;">✘</span>
        <br>Score 1 : 0.9879 | Score 2 : 0.9513</p>
    </div>
  </div>
  <p>The bias of camera motion on Vbench metric (VB-SC) is evident in its scores, whereas Tracker-FG purely focuses on object consistency.</p>
  <hr />

  <h3>Cases where primary objects are not generated from the same prompt</h3>
  <h3>Example 5</h3>
  <div class="video-section">
    <div class="video-row">
      <video autoplay loop muted>
        <source src="analysis_videos/fgtracker_good_dino/fgtracker_good_dino_2_v1.mp4" type="video/mp4" />
        Your browser does not support the video tag.
      </video>
      <video autoplay loop muted>
        <source src="analysis_videos/fgtracker_good_dino/fgtracker_good_dino_2_v2.mp4" type="video/mp4" />
        Your browser does not support the video tag.
      </video>
    </div>
    <div class="video-text">
      <p><b>Human:</b> <br>Video 1 <span style="color:green;">✔</span> | Video 2 <span style="color:red;">✘</span></p>
      <p><b>VB-SC:</b>
        <br>Video 1 <span style="color:red;">✘</span> | Video 2 <span style="color:green;">✔</span>
        <br>Score 1 : 0.8174 | Score 2 : 0.9828</p>
      <p><b>Tracker-FG (Ours):</b> 
        <br>Video 1 <span style="color:green;">✔</span> | Video 2 <span style="color:red;">✘</span>
    </div>
  </div>
  <p>The model in Video 2 is incapable of generating the primary object, thus we consider this video to be low quality. Object detection does not work for this scene. Therefore, we use object detection as the pairwise metric.</p>
  <hr />


  <div class="home-link">
    <a href="../SupplementaryVideos.html">Home</a>
  </div>
</div>
</body>
</html>
