<!DOCTYPE html>
<html lang="en">
  <head>
    <meta charset="utf-8">
    <meta name="viewport" content="width=device-width, initial-scale=1, shrink-to-fit=no">
    <!-- The above 3 meta tags *must* come first in the head; any other head content must come *after* these tags -->
    <meta name="description" content="">
    <meta name="author" content="">

    <title>SpotLight: Shadow-Guided Object Relighting via Diffusion &mdash; Supplementary Material</title>

    <!-- Bootstrap core CSS -->
    <link href="./css/bootstrap.min.css" rel="stylesheet">
    <link href="./css/lightbox.min.css" rel="stylesheet">

    <script src="https://polyfill.io/v3/polyfill.min.js?features=es6"></script>
    <script id="MathJax-script" async src="https://cdn.jsdelivr.net/npm/mathjax@3/es5/tex-mml-chtml.js"></script>

    <!-- Custom styles for this template -->
    <link href="./css/sticky-footer-navbar.css" rel="stylesheet">
    <style>
      /* Center and constrain the table */
      #results-container {
        display: flex;
        flex-wrap: wrap;
        justify-content: center; /* Center align items horizontally */
        max-width: 100%; /* Constrain overall width */
        margin: 0 auto; /* Center align the container itself */
      }
  
      .result-cell {
        margin: 0px; /* Space between cells */
        overflow: hidden;
        border: 1px solid #ddd;
        text-align: center;
        padding: 0;
      }
  
      .img-container {
        position: relative;
        width: 100%;
      }
  
      .result-cell img {
        width: 100%;
        height: auto;
        position: absolute;
        top: 0;
        left: 0;
      }
  
      .method-label {
        text-align: center;
        margin-top: 5px;
      }
  
      .row.mb-3 {
        width: 100%;
        justify-content: center; /* Align rows to center */
      }

      figurecaption {
        text-align: center;
        display: block;
      }
    </style>
  </head>
  <body>

    <!-- Begin page content -->
    <div class="container">
      <div class="mt-1">
        <h1>SpotLight: Shadow-Guided Object Relighting via Diffusion &mdash; Supplementary Material</h1>
      </div>
      <p align="justify" class="lead">
        We present additional results complementing the main paper. In particular, we show interactive relighting results,
        using pre-computed images generated by our method and baselines.
        <br><br>
        <span class="font-weight-bold">Hover your mouse over the images to see animated relighting results!</span>
      </p>

      <div style="clear:both; height:2em;"></div>
      <h3>Table of Contents</h3>
      <ol>
        <li>
          <a href="#technical_details">Technical details on the shadow blending (extends sec. 4.2)</a>
        </li>
        <li>
          <a href="#rgb_x_backbone">RGB-X backbone (extends sec. 5.2)</a>
        </li>
        <li>
          <a href="#envmap_methods">Lighting-conditioned methods parametrization (extends sec. 5.3.2)</a>
        </li>
        <li>
          <a href="#additional_results">Additional quantitative results (extends sec. 5.4, tab. 1 and tab. 2)</a>
        </li>
        <li>
          <a href="#qual_results_control">Additional qualitative results (extends sec. 5.4, fig. 3) <b>(Interactive)</b></a>
        </li>
        <li>
          <a href="#user_study">User studies (extends sec. 5.5)</a>
        </li>
        <li>
          <a href="#parameter_control">Parameters control (extends sec. 5.6) <b>(Interactive)</b></a>
        </li>
        <li>
          <a href="#virtual_light_source">Light source radius (extends sec. 5.6) <b>(Interactive)</b></a>
        </li>
        <li>
          <a href="#outdoor">Outdoor scenes (extends sec. 6)</a>
        </li>
        <li>
          <a href="#exotic_objects">Diverse objects (extends sec. 6)</a>
        </li>
        <li>
          <a href="#multiple_lights">Multiple lights (extends sec. 6) <b>(Interactive)</b></a>
        </li>
        <li>
          <a href="#bg_relight">Full image relighting (extends sec. 6) <b>(Interactive)</b></a>
        </li>

      </ol>

      <a class="anchor" name="technical_details"></a>
      <h2>1. Technical details on the shadow blending</h2>
      <p>
        Here we further detail how we obtain the latent shadow mask \(\mathbf{m}_{\text{shw},\downarrow}\) from the full resolution guiding shadow \(\mathbf{m}_{\text{shw}}\).
        We first downsample the shadow \(\mathbf{m}_{\text{shw}}\)
        by simple bilinear interpolation.
        We then binarize the downsampled shadow using a threshold of 0.05 (to include the softer parts of the shadow in the mask).
        We dilate the mask using a \(3\times 3\) kernel to include the details at the edge, and further multiply by two the mask value at the edge.
        Finally, we remove from the shadow mask its intersection with the downsampled object mask, to avoid the shadow from leaking in the object shading, in the latent space.
      </p>
      <br>
      <br>
      <a class="anchor" name="rgb_x_backbone"></a>
      <h2>2. RGB-X backbone</h2>
      <p>
        We employ the checkpoint of their X->RGB model finetuned for inpainting rectangular masks, and mask out the bounding
        box
        including both the object and shadow region.
        To estimate intrinsic properties of test images (normals, metallic, roughness, and albedo), we use the RGB->X model.
        We
        observe that results are generally overly bright, likely due to a domain gap between the training and evaluation
        images.
        To address this, we re-expose the background by a factor of 2 before feeding it to the network, then divide the output
        by 2, in linear space. We apply the same background preservation strategy as in ZeroComp.
      </p>
      <p>We show quantitative and qualitative results of the RGB-X backbone in sections 4 and 5 of this supplementary material.</p>
      <br>
      <br>
      <a class="anchor" name="envmap_methods"></a>
      <h2>3. Lighting-conditioned methods parametrization</h2>
      <p>
        Both DiLightNet and Neural Gaffer require an environment map for lighting. We construct such an environment map with
        radiance \(L_{\text{env}}\) along direction \(\omega\) defined as the sum of a spherical gaussian and a constant term
        \begin{equation}
        L_{\text{env}}(\omega) = \mathbf{c}_\text{light} e^{\lambda (\omega \cdot \mathbf{v} - 1)} + \mathbf{c}_\text{amb} \,,
        \end{equation}
        where \(\mathbf{c}_\text{light}\) and \(\mathbf{c}_\text{amb}\) are the RGB colors of the light and ambient terms
        resp.,
        \(\mathbf{v}\) is the dominant light source direction, and \(\lambda\) is the bandwidth. \(\mathbf{c}_\text{amb}\) is
        obtained by computing the average color over the background image. Panoramas from the Laval Indoor HDR dataset
        (excluding those in our test set) are used to estimate a single average intensity of the dominant light source, \(k\),
        which is set to the ratio of the integral of the brightest pixels in the panoramas, divided by the integral over all
        pixels. We further divide this integral by 2 to consider a single hemisphere and avoid overly bright dominant light
        sources. The light color is defined by \(\mathbf{c}_\text{light} = k \mathbf{c}_\text{amb}'\), where
        \(\mathbf{c}_\text{amb}'\) is the normalized ambient color \(\mathbf{c}_\text{amb}\). We found that the bandwidth
        parameter \(\lambda\) did not advantage any specific method and therefore fixed it to \(\lambda = 300\), which we
        typically observe for an indoor light.</p>
      <br>
      <br>
      <a class="anchor" name="additional_results"></a>
      <h2>4. Additional quantitative results</h2>
      <p>
        We extend the quantitative results from tab. 1. First, we compute metrics separately for the foreground
        (evaluating the fidelity of the object's shading compared to the ground truth) and the background (evaluating the
        fidelity of the background shadows compared to the ground truth).
        Then, we include additional results, including SpotLight applied to the RGB-X backbone.
        </p>
        <table border="1" class="dataframe table table-striped table-bordered">
          <thead>
            <tr>
              <th>Method</th>
              <th colspan="5" halign="left">Full image</th>
              <th colspan="3" halign="left">Foreground only</th>
              <th colspan="2" halign="left">Background only</th>
            </tr>
            <tr>
              <th></th>
              <th>PSNR</th>
              <th>SSIM</th>
              <th>RMSE</th>
              <th>MAE</th>
              <th>LPIPS</th>
              <th>RMSE</th>
              <th>MAE</th>
              <th>SI-RMSE</th>
              <th>RMSE</th>
              <th>MAE</th>
            </tr>
          </thead>
          <tbody>
            <tr>
              <td>DiLightNet</td>
              <td>24.67</td>
              <td>0.948</td>
              <td>0.064</td>
              <td>0.022</td>
              <td>0.042</td>
              <td>0.207</td>
              <td>0.192</td>
              <td>0.055</td>
              <td>0.03</td>
              <td>0.009</td>
            </tr>
            <tr>
              <td>Neural Gaffer</td>
              <td>28.44</td>
              <td>0.963</td>
              <td>0.042</td>
              <td>0.015</td>
              <td>0.038</td>
              <td>0.102</td>
              <td>0.088</td>
              <td>0.049</td>
              <td>0.03</td>
              <td>0.009</td>
            </tr>
            <tr>
              <td>IC-Light</td>
              <td>26.87</td>
              <td>0.959</td>
              <td>0.054</td>
              <td>0.019</td>
              <td>0.04</td>
              <td>0.153</td>
              <td>0.13</td>
              <td>0.062</td>
              <td>0.03</td>
              <td>0.009</td>
            </tr>
            <tr>
              <td>ZeroComp+SDEdit</td>
              <td>26.0</td>
              <td>0.938</td>
              <td>0.053</td>
              <td>0.025</td>
              <td>0.079</td>
              <td><b>0.079</b></td>
              <td><b>0.064</b></td>
              <td>0.048</td>
              <td>0.048</td>
              <td>0.022</td>
            </tr>
            <tr>
              <td>SpotLight (no guidance)</td>
              <td><b>31.69</b></td>
              <td><b>0.976</b></td>
              <td><b>0.029</b></td>
              <td><b>0.011</b></td>
              <td><b>0.029</b></td>
              <td>0.086</td>
              <td>0.073</td>
              <td><b>0.046</b></td>
              <td><b>0.017</b></td>
              <td><b>0.006</b></td>
          </tr>
          <tr>
            <td>SpotLight (with guidance, ours)</td>
            <td>30.68</td>
            <td>0.974</td>
            <td>0.033</td>
            <td>0.012</td>
            <td>0.03</td>
            <td>0.1</td>
            <td>0.085</td>
            <td>0.05</td>
            <td>0.018</td>
            <td><b>0.006</b></td>
            </tr>
            <tr>
              <td>SpotLight (RGB-X backbone)</td>
              <td>26.56</td>
              <td>0.955</td>
              <td>0.051</td>
              <td>0.018</td>
              <td>0.042</td>
              <td>0.176</td>
              <td>0.155</td>
              <td>0.06</td>
              <td>0.02</td>
              <td>0.007</td>
            </tr>
            </tbody>
      </table>
      
      <p>
        We also extend the results from tab. 2, showing the results of the full image, foreground only, and background only
        for the latent mask weight β and guidance scale γ in the following two tables. We observe that some changes in
        parameters, like using no blending (β=0), or using no guidance (γ=1) may provide better quantitative results. However,
        we observe in our qualitative evaluation and user studies that these changes diminish the level of light control over
        the object. Our selected parameter
        combination provides good quantitative performance and adequate lighting control.
      </p>
      <table border="1" class="dataframe table table-striped table-bordered">
        <thead>
          <tr>
            <th>Method</th>
            <th colspan="5" halign="left">Full image</th>
            <th colspan="3" halign="left">Foreground only</th>
            <th colspan="2" halign="left">Background only</th>
          </tr>
          <tr>
            <th></th>
            <th>PSNR</th>
            <th>SSIM</th>
            <th>RMSE</th>
            <th>MAE</th>
            <th>LPIPS</th>
            <th>RMSE</th>
            <th>MAE</th>
            <th>SI-RMSE</th>
            <th>RMSE</th>
            <th>MAE</th>
          </tr>
        </thead>
        <tbody>
          <tr>
            <td>SpotLight (γ=1, no guidance)</td>
            <td>31.69</td>
            <td>0.976</td>
            <td>0.029</td>
            <td>0.011</td>
            <td>0.029</td>
            <td>0.086</td>
            <td>0.073</td>
            <td>0.046</td>
            <td>0.017</td>
            <td>0.006</td>
          </tr>
          <td>SpotLight (γ=3, with guidance, ours)</td>
          <td>30.68</td>
          <td>0.974</td>
          <td>0.033</td>
          <td>0.012</td>
          <td>0.03</td>
          <td>0.1</td>
          <td>0.085</td>
          <td>0.05</td>
          <td>0.018</td>
          <td>0.006</td>
          </tr>
          <tr>
            <td>SpotLight (γ=7)</td>
            <td>28.68</td>
            <td>0.966</td>
            <td>0.043</td>
            <td>0.015</td>
            <td>0.036</td>
            <td>0.138</td>
            <td>0.116</td>
            <td>0.062</td>
            <td>0.019</td>
            <td>0.006</td>
          </tr>
        </tbody>
        </table>
        
        
        <table border="1" class="dataframe table table-striped table-bordered">
          <thead>
            <tr>
              <th>Method</th>
              <th colspan="5" halign="left">Full image</th>
              <th colspan="3" halign="left">Foreground only</th>
              <th colspan="2" halign="left">Background only</th>
            </tr>
            <tr>
              <th></th>
              <th>PSNR</th>
              <th>SSIM</th>
              <th>RMSE</th>
              <th>MAE</th>
              <th>LPIPS</th>
              <th>RMSE</th>
              <th>MAE</th>
              <th>SI-RMSE</th>
              <th>RMSE</th>
              <th>MAE</th>
            </tr>
          </thead>
          <tbody>
            <tr>
              <td>SpotLight (β=0.2)</td>
              <td>29.24</td>
              <td>0.969</td>
              <td>0.039</td>
              <td>0.014</td>
              <td>0.034</td>
              <td>0.102</td>
              <td>0.086</td>
              <td>0.051</td>
              <td>0.025</td>
              <td>0.008</td>
            </tr>
            <td>SpotLight (β=0.05, ours)</td>
            <td>30.68</td>
            <td>0.974</td>
            <td>0.033</td>
            <td>0.012</td>
            <td>0.03</td>
            <td>0.1</td>
            <td>0.085</td>
            <td>0.05</td>
            <td>0.018</td>
            <td>0.006</td>
            </tr>
            <tr>
              <td>SpotLight (β=0, no blending)</td>
              <td>30.81</td>
              <td>0.974</td>
              <td>0.032</td>
              <td>0.012</td>
              <td>0.029</td>
              <td>0.098</td>
              <td>0.083</td>
              <td>0.05</td>
              <td>0.018</td>
              <td>0.006</td>
            </tr>
          </tbody>
        </table>
      <a class="anchor" name="qual_results_control"></a>
      <br>
      <br>
      <h2>5. Additional qualitative results</h2>
      <p align="justify">We present 20 additional randomly-selected results on the "user-controlled" dataset, extending the
        qualitative results from fig. 5.
        In this case, we rendered 8 light directions instead
        of 5 (as used for the user study), in order to also show results where the shadow is behind the object.
        <br>
        <strong>Move your mouse from left to right over the images to see the light direction change.</strong>
      </p>
      
      <div id="results-container-fig-5-ext" class="row"></div>
      <br>
      <br>
      <a class="anchor" name="user_study"></a>
      <h2>6. User studies</h2>
      <h3>6.1 Additional user studies on object shading and shadows only</h3>
        <p>
          In addition to the overall realism and lighting control user studies, we conducted two user studies to disentangle
          the shadow from the shading realism.
          To evaluate shading realism, the shadow needs to be fixed for all methods. To do so, for all methods, we replace the
          region outside the object mask by the output of SpotLight, containing a refined shadow.
          To evaluate shadow realism, we do the opposite. We replace the region within the object mask by the output of
          SpotLight, containing the shaded object.
          We show the results of these two user studies below.
        <div class="container">
          <div class="row">
            <div class="col-md-6">
              <!-- Use <figure> and <figcaption> for semantics, if desired -->
              <figure class="figure">
                <img src="figures/user_study/thurstone_User Study (shading only).svg" class="figure-img img-fluid w-100"
                  alt="Shading realism user study results">
                <figcaption class="text-center">
                  Shading realism user study results (N=13)
                </figcaption>
              </figure>
            </div>
            <div class="col-md-6">
              <figure class="figure">
                <img src="figures/user_study/thurstone_User Study (shadows only).svg" class="figure-img img-fluid w-100"
                  alt="Shadow realism user study results">
                <figcaption class="text-center">
                  Shadow realism user study results (N=11)
                </figcaption>
              </figure>
            </div>
          </div>
        </div>
        We observe that ZeroComp+SDEdit achieves similar results to SpotLight in terms of shading realism, when the user is
        not asked about the controllability of the lighting. However,
        SpotLight generates more realistic shading than Neural Gaffer with statistical significance, even if Neural Graffer is
        trained to shade objects correctly.
        <br>
        On the task of refining the guiding shadow into a realistic one, SpotLight achieves higher results than both
        ZeroComp+SDEdit and using the guiding shadow directly.
        We attribute this to our shadow blending strategy, which acts as a soft constraint over the shadow generation at each
        denoising timestep.
        <br><br>
        <h3>6.2 Additionnal lighting control study</h3>
        <p>In addition to the lighting control user study in fig. 4b, that compares our method to the ZeroComp+SDEdit,
          we show here that disabling the guidance term yields a similar degradation in perceptual scores for the lighting
          control setting.
        </p>
        <div class="container">
          <div class="row">
            <div class="col-md-6">
              <!-- Use <figure> and <figcaption> for semantics, if desired -->
              <figure class="figure">
                <img src="figures/user_study/thurstone_User Study (control, shading only) vs SDEdit.svg"
                  class="figure-img img-fluid w-100">
                <figcaption class="text-center">
                  Lighting control user study against ZeroComp+SDEdit (N=8)
                </figcaption>
              </figure>
            </div>
            <div class="col-md-6">
              <figure class="figure">
                <img src="figures/user_study/thurstone_User Study (control, shading only) vs CFG=1.svg"
                  class="figure-img img-fluid w-100">
                <figcaption class="text-center">
                  Shadow realism user study against no guidance (N=7)
                </figcaption>
              </figure>
            </div>
          </div>
        </div>
        <h3>6.3 Further details</h3>
        <h4>User study randomization</h4>
        <p>
          To use the Thurstone case V Law of comparative judgement, we need a fixed set of comparisons, shown to all
          observers.
          This set of comparison is randomly sampled and reused for all users.
          To limit bias, the method left/right ordering is randomized, and the order of comparisons was randomized for each
          observer.
          All user studies were conducted using different sets of observers, to avoid bias.
        </p>
      
        <h4>Observer filtering</h4>
        <p>
          For each study, three sentinels images were randomly placed in the questions. Incorrectly selecting one of the
          sentinel answers led to exclusion.
          In the realism user studies, the sentinels corresponded to the object set to be full white, therefore having highly
          unrealistic shading.
          For the light control user studies, the sentinels corresponded to an object where the object's lighting doesn't
          change as the shadow moves.
        </p>
        <h4>Per-study details</h4>
        <p>Here, we show the details of each user study. "Filtered observers" represents the number of observers that ended up
          contributing to the scores, which were selected since they didn't click the sentinels.
          Also note that the number of questions shown includes the three sentinel questions asked to the users.
        </p>
        <table border="1" class="dataframe table table-striped table-bordered">
          <thead>
            <tr>
              <th>User study name</th>
              <th>Original observers</th>
              <th>Filtered observers (N)</th>
              <th>Questions per method pairs</th>
              <th>Total questions</th>
            </tr>
          </thead>
          <tbody>
            <tr>
              <td>Overall realism</td>
              <td>40</td>
              <td>35</td>
              <td>20</td>
              <td>123</td>
            </tr>
            <tr>
              <td>Shadow realism</td>
              <td>14</td>
              <td>13</td>
              <td>40</td>
              <td>123</td>
            </tr>
            <tr>
              <td>Shading realism</td>
              <td>14</td>
              <td>11</td>
              <td>40</td>
              <td>123</td>
            </tr>
            <tr>
              <td>Lighting control (vs. ZeroComp+SDEdit)</td>
              <td>10</td>
              <td>8</td>
              <td>40</td>
              <td>43</td>
            </tr>
            <tr>
              <td>Lighting control (vs. no guidance)</td>
              <td>10</td>
              <td>7</td>
              <td>40</td>
              <td>43</td>
            </tr>
          </tbody>
        </table>
        <h3>6.4 User study interface</h3>
        <h4>Realism study interface</h4>
        <p>
          Below, we show the instructions page of the overall realism user study and a question from it.
        </p>
      
        <div class="row">
          <div class="col-md-12">
            <a href="figures/user_study/user_study_realism_instructions.png" target="_blank">
              <img src="figures/user_study/user_study_realism_instructions.png" class="border border-secondary"
                alt="User study instructions" style="width: 100%;">
            </a>
            <figurecaption>Overall realism user study instructions</figurecaption>
          </div>
        </div>
        <br>
        <br>
        <div class="row">
          <div class="col-md-12">
            <a href="figures/user_study/user_study_realism_question.png" target="_blank">
              <img src="figures/user_study/user_study_realism_question.png" class="border border-secondary"
                alt="User study question" style="width: 100%;">
            </a>
            <figurecaption>Overall realism user study question</figurecaption>
          </div>
        </div>
        <br>
        <br>
        <h4>Lighting control study interface</h4>
        <p>
          And here, we show a video demonstration of the lighting control user study.
        </p>
        <div class="col-12">
          <figure class="figure w-100">
            <video class="figure-img img-fluid w-100" controls>
              <source src="figures/user_study/control_user_study_vs_cfg_1.mp4" type="video/mp4">
              Your browser does not support the video tag.
            </video>
            <figcaption class="text-center">
              Lighting control user study demonstration
            </figcaption>
          </figure>
        </div>
        </p>
        <br>
        <br>
        <a class="anchor" name="parameter_control"></a>
        <h2>7. Additional results for parameters control</h2>
        <p align="justify">
          We show how the two main parameters of our method can be adjusted for enhanced artistic control, extending sec. 4.2
          and
          sec. 4.3.
          Here, both parameters are modified separately, but they could be modified simultaneously for optimal results.
        </p>
        <h3>7.1. Guidance scale γ</h3>
        <p align="justify">Here, we show control over the guidance scale γ, extending fig. 6.
          Increasing the guidance scale generally makes the dominant light source stronger.
          <br>
          <strong>Move the mouse vertically to adjust the guidance scale</strong>, and horizontally to adjust the light
          azimuth
          angle.
        </p>
        <p>The current light azimuth angle is <strong id="cfg-azimuth-value">0°</strong>.</p>
        <p align="justify">The current guidance scale is <strong id="cfg-gamma-value">γ=3.0</strong>.</p>
        <div id="results-container-cfg" class="row"></div>
      
        <h3>7.2. Latent mask weight β</h3>
        <p align="justify">Here, we show control over the latent mask weight β, extending fig. 7.
          Increasing the latent mask weight makes the dominant shadow darker.
          <br>
          <strong>Move the mouse vertically to adjust the latent mask weight</strong>, and horizontally to adjust the light
          azimuth angle.
        </p>
        <p>The current light azimuth angle is <strong id="latent-weight-azimuth-value">0°</strong>.</p>
        <p align="justify">The current latent weight is <strong id="latent-weight-value">β=0.05</strong>.</p>
        <div id="results-container-latent-weight" class="row"></div>
        <br>
        <br>
        <a class="anchor" name="virtual_light_source"></a>
        <h2>8. Light source radius</h2>
      
        <p align="justify">We adjust the light source radius used to generate the coarse shadow fed to SpotLight.
          Small radii lead to hard shadows, whereas higher radii generate more diffuse shadows.
          <br>
          <strong>Move the mouse vertically to adjust the light source radius</strong>, and horizontally to adjust the light
          azimuth angle.
      
        </p>
        <p>The current light azimuth angle is <strong id="light-size-azimuth-value">0°</strong>.</p>
        <p align="justify">The current light radius is <strong id="light-size-value">1 (default)</strong>.</p>
        <div id="results-container-light-size" class="row"></div>
      
        <a class="anchor" name="outdoor"></a>
        <br>
        <br>
        <h2>9. Outdoor scenes</h2>
        <p align="justify">
          Even though SpotLight relies on a diffusion renderer trained exclusively on indoor data,
          it can generalize to outdoor scenes well.
        </p>
        <div class="row">

          <div class="col p-1">
            <a href="./figures/outdoor/cardboardbox/frame0.jpg" target="_blank">
              <img src="./figures/outdoor/cardboardbox/frame0.jpg" alt="Background" style="width:100%">
            </a>
          </div>
          <div class="col p-1">
            <a href="./figures/outdoor/concrete/frame0.jpg" target="_blank">
              <img src="./figures/outdoor/concrete/frame0.jpg" alt="Background" style="width:100%">
            </a>
          </div>
          <div class="col p-1">
            <a href="./figures/outdoor/cat2/frame0.jpg" target="_blank">
              <img src="./figures/outdoor/cat2/frame0.jpg" alt="Background" style="width:100%">
            </a>
          </div>
          <div class="col p-1">
            <a href="./figures/outdoor/vase/frame0.jpg" target="_blank">
              <img src="./figures/outdoor/vase/frame0.jpg" alt="Background" style="width:100%">
            </a>
          </div>
        </div>
        <br>

        <a class="anchor" name="exotic_objects"></a>
        <br>
        <br>
        <h2>10. Diverse objects</h2>
        <p align="justify">
          SpotLight is not restricted to furniture and can handle many kinds of virtual objects.
        </p>
          
        <div class="row">

          <div class="col p-1">
            <a href="./figures/exotic_objects/teapot/frame0.jpg" target="_blank">
              <img src="./figures/exotic_objects/teapot/frame0.jpg" alt="Background" style="width:100%">
            </a>
          </div>
          <div class="col p-1">
            <a href="./figures/exotic_objects/cow/frame0.jpg" target="_blank">
              <img src="./figures/exotic_objects/cow/frame0.jpg" alt="Background" style="width:100%">
            </a>
          </div>
          <div class="col p-1">
            <a href="./figures/exotic_objects/fox/0012.jpg" target="_blank">
              <img src="./figures/exotic_objects/fox/0012.jpg" alt="Background" style="width:100%">
            </a>
          </div>
          <div class="col p-1">
            <a href="./figures/exotic_objects/boot/0012.jpg" target="_blank">
              <img src="./figures/exotic_objects/boot/0012.jpg" alt="Background" style="width:100%">
            </a>
          </div>
        </div>
        <br>

        <!--
        <div class="row">

          <div class="col p-1">
            <a href="./figures/soft_hard/soft/0006.jpg" target="_blank">
              <img src="./figures/soft_hard/soft/0006.jpg" alt="Background" style="width:100%">
            </a>
          </div>
          <div class="col p-1">
            <a href="./figures/soft_hard/hard/0006.jpg" target="_blank">
              <img src="./figures/soft_hard/hard/0006.jpg" alt="Background" style="width:100%">
            </a>
          </div>
          <div class="col p-1">
            <a href="./figures/non_planar_shadows/9C4A0307-ca23f47d11_00_crop_B075X5HB1D/0007.jpg" target="_blank">
              <img src="./figures/non_planar_shadows/9C4A0307-ca23f47d11_00_crop_B075X5HB1D/0007.jpg" alt="Background" style="width:100%">
            </a>
          </div>
          <div class="col p-1">
            <a href="./figures/non_planar_shadows/9C4A2551-3436caaf51_00_crop_B082QB3D8Y/0004.jpg" target="_blank">
              <img src="./figures/non_planar_shadows/9C4A2551-3436caaf51_00_crop_B082QB3D8Y/0004.jpg" alt="Background" style="width:100%">
            </a>
          </div>
        </div>
        <br>
        <br>
        
-->

        <a class="anchor" name="multiple_lights"></a>
        <br>
        <br>
        <h2>11. Multiple lights</h2>
        <p>In our paper, we analyze all the methods by conditioning on a single dominant light source, which is sufficiently
          realistic in most cases.
          Here, we show that we can combine outputs from SpotLight at different light directions to simulate multiple light
          sources.
          We combine a static light direction (shadow to the right of the object), with a dynamic direction (hover over the
          images to move this virtual light). We combine the two lightings
          in linear space (by using the gamma of 2.2) using the following equation:
          $$x_{\text{combined}}=(0.5 \times {x_{\text{light 1}}}^{2.2} + 0.5 \times {x_{\text{light
          2}}}^{2.2})^{\frac{1}{2.2}}.$$
          Notice how the static shadow and shading have a clear effect on the combined output. <br><br>
        <div id="results-container-two-lights" class="row"></div>
        <br>
        <br>
        <a class="anchor" name="bg_relight"></a>
        <h2>12. Full image relighting</h2>
        <p align="justify">In our work, we use SpotLight for <i>object</i> relighting. We experiment with extending SpotLight
          for full-scene relighting using the following approach.
        <ol>
          <li>We run SpotLight as-is to render the object lit with a desired shadow. We obtain the <strong>SpotLight</strong>
            output.</li>
          <li>We run the backbone (in this case ZeroComp), with the inverted shading mask, feeding it only the shading of the
            object and its shadow
            (obtained by dividing the SpotLight prediction with the albedo map)
            and letting the backbone shade the background.
            We obtain the <strong>relit background</strong> output.</li>
          <li>We repeat step 2 for each of the 8 light directions, and average them, giving us the <strong>average relit
              background</strong>.</li>
          <li>We find that the background relighting in step 2 and 3 poorly reconstruct the background's identity. We
            therefore
            obtain a per-pixel highlight map, by dividing
            the relit background prediction with the average relit background. We multiply that highlight map with the
            original
            background and obtain a more accurate output, <strong>relit background + composition</strong>.
          </li>
        </ol>
        We observe in the following example that background shadows and highlights are added in a consistent manner with
        respect
        to the object's shadow.
        </p>
      
        <div id="results-container-bg-relight" class="row"></div>
        <h6>Note about the checkpoint used for background relighting</h6>
        <p align="justify">
          In step 1, we use the same ZeroComp checkpoint as in all other experiments. For steps 2 and 3, in order to properly
          relight the background, we found that this checkpoint
          had limited full-scene relighting capabilities.
          We hypothesize that this is due to the fact that the neural renderer backbone is trained to relight only a small
          region of an image (circular and rectangular masks at training time).
          We therefore train a separate model from scratch for 270K iterations on inverted shading masks, where we take the
          inverse of the circular and rectangular masks for shading masking. Furthermore,
          we only keep the largest connected component in the mask. This leaves training examples where only a small region of
          the shading is known and the full background lighting needs to be inferred.
        </p>
        <br>
        <br>

    <footer class="footer fixed-bottom" style="left: 0">
      <div class="container">
        <span class="text-muted">SpotLight: Shadow-Guided Object Relighting via Diffusion &mdash; Supplementary Material</span>
      </div>
    </footer>
  </body>
  <script src="./js/custom/additional_qual.js"></script>
  <script src="./js/custom/custom_exp.js"></script>
  <script>
    
    makeResultsTable({
        resultsContainer: document.getElementById('results-container-fig-5-ext'),
        methods: [
        'ic_light_bg_guidance_ambiant_eevee_denoised',
        'dilightnet_ambiant_eevee_denoised',
        'neural_gaffer_erode5_ambiant_eevee_denoised',
        'zerocomp_coarse_shadow_sd_edit_0.5_post_color_balanced_denoised',
        'rgbx_cfg_3_intensity_2_inpaint_iccv_post_color_balanced_denoised', 
        'shadow_comp_v3_kornia_3x3_edge_2_post_color_balanced_denoised'
        ],
        cropNames: ["9C4A8088-1bb62eeb83_07_crop_B07B4MRLNC", "AG8A6719-fc8e1ea686_05_crop_B072Y5MZQH", "9C4A0400-e71f3506f8_00_crop_B07QC8CGKK", "9C4A0395-ef28332472_09_crop_B07YPLQ7KW", "AG8A1932-eb0d101d0e_07_crop_B00BBDF500", "9C4A3419-2e5c9b4d85_00_crop_B07B4MG2MD", "9C4A4668-16193e7517_03_crop_B072FVHS4V", "AG8A4106-248c9b6695_01_crop_B07G2WWZC5", "9C4A6765-55315335bf_09_crop_B076YFLYFR", "9C4A0400-e71f3506f8_00_crop_B07B4CZMNQ", "9C4A0566-a088c98ccf_00_crop_B07HZ6ZCW7", "9C4A0132-07352d1dd0_08_crop_B07PHP31FB", "9C4A9487-24b5ce36e4_01_crop_B07B4MDGDF", "AG8A4106-248c9b6695_00_crop_B07HZ1LZS1", "AG8A8563-e652dd5751_02_crop_B07QHYDDS1", "9C4A0132-07352d1dd0_07_crop_B07DBF3WMC", "9C4A6864-533a74ab58_04_crop_B076VF4KRN", "9C4A5673-d4ecde512a_00_crop_B07HPTBB7P", "AG8A8710-4a14a72a31_05_crop_B07B4LZQC9", "9C4A0400-e71f3506f8_00_crop_B082QBBY7R"],
        methodsMapping: {
        'shadow_comp_v3_kornia_3x3_edge_2_post_color_balanced_denoised': "SpotLight (ZeroComp backbone)",
        'rgbx_cfg_3_intensity_2_inpaint_iccv_post_color_balanced_denoised': 'SpotLight (RGB-X backbone)',
        'ic_light_bg_guidance_ambiant_eevee_denoised': 'IC-Light',
        'dilightnet_ambiant_eevee_denoised': 'DiLightNet',
        'neural_gaffer_erode5_ambiant_eevee_denoised': 'Neural Gaffer',
        'zerocomp_coarse_shadow_sd_edit_0.5_post_color_balanced_denoised': 'ZeroComp+SDEdit',
        'stick-renders': 'Light direction',
        'simulated_gt': 'Ground truth',
        },
        folderName: 'figures/ext_fig_qual',
    });

    makeResultsTable2({
        resultsContainer: document.getElementById('results-container-cfg'),
        azimuthValueText: document.getElementById('cfg-azimuth-value'),
        parameterValueText: document.getElementById('cfg-gamma-value'),
        methods: [
        'shadow_comp_v3_cfg_1_post_color_balanced_denoised',
        'shadow_comp_v3_kornia_3x3_edge_2_post_color_balanced_denoised',
        'shadow_comp_v3_cfg_7_post_color_balanced_denoised',
        ],
        cropNames: ["AG8A6551-16ae8050c9_06_crop_B075X4QFK2", "9C4A8239-7a9d7093e2_04_crop_B082VLJ7Y2", "9C4A0566-a088c98ccf_05_crop_B07VKM698C", "9C4A3419-2e5c9b4d85_08_crop_B07B4D8B2S", "9C4A4878-4dac321d8b_02_crop_B07HP93VDJ", "9C4A4861-19626f89e9_04_crop_B07B4MSYPS", "AG8A0630-e5622e17d2_01_crop_B07B85FJD5", "AG8A6719-fc8e1ea686_05_crop_B07DBJ1H18", "AG8A3196-8e1dfd8d95_08_crop_B07WMQ8P8Q", "AG8A8843-b91f89fff2_05_crop_B07B4YXNR3"],
        methodsMapping: {
        'shadow_comp_v3_cfg_1_post_color_balanced_denoised': 'γ=1.0 (no guidance)',
        'shadow_comp_v3_kornia_3x3_edge_2_post_color_balanced_denoised': 'γ=3.0 (default)',
        'shadow_comp_v3_cfg_7_post_color_balanced_denoised': 'γ=7.0',
        },
        folderName: 'figures/ext_cfg',
    });

    makeResultsTable2({
        resultsContainer: document.getElementById('results-container-latent-weight'),
        azimuthValueText: document.getElementById('latent-weight-azimuth-value'),
        parameterValueText: document.getElementById('latent-weight-value'),
        methods: [
        'shadow_comp_v3_latent_weight_0_post_color_balanced_denoised',
        'shadow_comp_v3_kornia_3x3_edge_2_post_color_balanced_denoised',
        'shadow_comp_v3_latent_weight_0.2_post_color_balanced_denoised',
        ],
        cropNames: ["AG8A6551-16ae8050c9_06_crop_B075X4QFK2", "9C4A8239-7a9d7093e2_04_crop_B082VLJ7Y2", "9C4A0566-a088c98ccf_05_crop_B07VKM698C", "9C4A3419-2e5c9b4d85_08_crop_B07B4D8B2S", "9C4A4878-4dac321d8b_02_crop_B07HP93VDJ", "9C4A4861-19626f89e9_04_crop_B07B4MSYPS", "AG8A0630-e5622e17d2_01_crop_B07B85FJD5", "AG8A6719-fc8e1ea686_05_crop_B07DBJ1H18", "AG8A3196-8e1dfd8d95_08_crop_B07WMQ8P8Q", "AG8A8843-b91f89fff2_05_crop_B07B4YXNR3"],
        methodsMapping: {
        'shadow_comp_v3_latent_weight_0_post_color_balanced_denoised': 'β=0 (no shadow blending)',
        'shadow_comp_v3_kornia_3x3_edge_2_post_color_balanced_denoised': 'β=0.05 (default)',
        'shadow_comp_v3_latent_weight_0.2_post_color_balanced_denoised': 'β=0.2',
        },
        folderName: 'figures/ext_latent_weight',
    });

    makeResultsTable2({
        resultsContainer: document.getElementById('results-container-light-size'),
        azimuthValueText: document.getElementById('light-size-azimuth-value'),
        parameterValueText: document.getElementById('light-size-value'),
        methods: [
        'shadow_comp_v3_light_radius_0_post_color_balanced_denoised',
        'shadow_comp_v3_kornia_3x3_edge_2_post_color_balanced_denoised',
        'shadow_comp_v3_light_radius_5_post_color_balanced_denoised',
        ],
        cropNames: ["AG8A6551-16ae8050c9_06_crop_B075X4QFK2", "9C4A8239-7a9d7093e2_04_crop_B082VLJ7Y2", "9C4A0566-a088c98ccf_05_crop_B07VKM698C", "9C4A3419-2e5c9b4d85_08_crop_B07B4D8B2S", "9C4A4878-4dac321d8b_02_crop_B07HP93VDJ", "9C4A4861-19626f89e9_04_crop_B07B4MSYPS", "AG8A0630-e5622e17d2_01_crop_B07B85FJD5", "AG8A6719-fc8e1ea686_05_crop_B07DBJ1H18", "AG8A3196-8e1dfd8d95_08_crop_B07WMQ8P8Q", "AG8A8843-b91f89fff2_05_crop_B07B4YXNR3"],
        methodsMapping: {
        'shadow_comp_v3_light_radius_0_post_color_balanced_denoised': '0',
        'shadow_comp_v3_kornia_3x3_edge_2_post_color_balanced_denoised': '1 (default)',
        'shadow_comp_v3_light_radius_5_post_color_balanced_denoised': '5',
        },
        folderName: 'figures/light_size',
    });
    
    //TODO
    makeResultsTable({
        resultsContainer: document.getElementById('results-container-two-lights'),
        methods: [
          'two_lights_shadow_comp_v2_default_post_color_balanced_static_denoised',
          'two_lights_shadow_comp_v2_default_post_color_balanced_denoised',
          'two_lights_shadow_comp_v2_default_full_composite',
        ],
        cropNames: ["AG8A0630-e5622e17d2_02_crop_B0717B4RRW", "AG8A8563-e652dd5751_02_crop_B07B78RCT5", "AG8A6551-16ae8050c9_06_crop_B075X4QFK2", "AG8A4106-248c9b6695_00_crop_B075X33SC6"],
        methodsMapping: {
          'two_lights_shadow_comp_v2_default_post_color_balanced_static_denoised': '$$x_{\\text{light 1}}$$',
          'two_lights_shadow_comp_v2_default_post_color_balanced_denoised': '$$x_{\\text{light 2}}$$',
          'two_lights_shadow_comp_v2_default_full_composite': '$$x_{\\text{combined}}$$',
        },
        folderName: 'figures/two_lights',
        cropSubfolder: false,
        imageSize: 320
    });

    makeResultsTable({
        resultsContainer: document.getElementById('results-container-bg-relight'),
        methods: [
          'shadow_comp_v2_default',
          'relight_shadow_comp_v2_default_default',
          'relight_shadow_comp_v2_default_avg',
          'relight_shadow_comp_v2_default_composed'
        ],
        cropNames: ["9C4A0132-07352d1dd0_01_crop_B07JVNBJTB", "AG8A6551-16ae8050c9_06_crop_B075X4QFK2", "9C4A6765-55315335bf_07_crop_B07DBDN79B", "9C4A2563-d4a99744c5_00_crop_B07DBGWFGY"],
        methodsMapping: {
          'shadow_comp_v2_default' : 'SpotLight',
          'relight_shadow_comp_v2_default_default' : 'Relit background',
          'relight_shadow_comp_v2_default_avg' : 'Average relit background',
          'relight_shadow_comp_v2_default_composed': 'Relit background + composition'
        },
        folderName: 'figures/bg_relight',
        cropSubfolder: false,
        imageSize: 250
    });
  </script>
    <script src="./js/lightbox-plus-jquery.min.js"></script>
    <script src="./js/bootstrap.min.js"></script>
</html>