<html lang="en">
<style>
    .tooltip {
        position: relative;
        display: inline-block;
        cursor: default;
    }

    .tooltip .tooltiptext {
        visibility: hidden;
        padding: 0.25em 0.5em;
        background-color: black;
        color: #fff;
        text-align: center;
        border-radius: 0.25em;
        white-space: nowrap;
        
        /* Position the tooltip */
        position: absolute;
        z-index: 1;
        top: 70%;
        left: 50%;
        transform: translateX(-50%);
        transition-property: visibility;
        transition-delay: 0s;
    }

    .tooltip:hover .tooltiptext {
        visibility: visible;
        transition-delay: 0.3s;
    }
</style>
<body>
    <title>DC-VideoGen</title>
    <h1 style="text-align: center; font-size: 50px;">DC-VideoGen: Efficient Video Generation with<br>Deep Compression Video Autoencoder</h1>

    <h1 style="text-align: center; color:#A31F34;">High Resolution Videos Generated by Our DC-VideoGen-Wan</h1>
    <div style="display: flex; width: 100%; gap: 0.5%;">
        <div style="width: 18.05%;">
            <div class="tooltip" style="text-align: center; width: 100%; margin-bottom: 0.5%;">
                <video controls muted autoplay loop width="100%">
                    <source src="teaser/i2v_720/1.mp4" type="video/mp4">
                    Your browser does not support the video tag.
                </video>
                <p class="tooltiptext">A corgi perched on a branch,<br>tensed and ready to leap to the ground.</p>
            </div>
            <div class="tooltip" style="text-align: center; width: 100%; margin-bottom: 0.5%;">
                <video controls muted autoplay loop width="100%">
                    <source src="teaser/i2v_720/2.mp4" type="video/mp4">
                    Your browser does not support the video tag.
                </video>
                <p class="tooltiptext">An astronaut and a knight<br>embrace in a desolate landscape.</p>
            </div>
            <div class="tooltip" style="text-align: center; width: 100%;">
                <video controls muted autoplay loop width="100%">
                    <source src="teaser/i2v_720/3.mp4" type="video/mp4">
                    Your browser does not support the video tag.
                </video>
                <p class="tooltiptext">An astronaut rides a horse<br>across the moon towards Earth.</p>
            </div>
            <p style="font-size: 20px; text-align: center;">
                <b>I2V 720P</b>
            </p>
        </div>
        <div style="width: 27.5%;">
            <div class="tooltip" style="text-align: center; width: 100%; margin-bottom: 0.5%;">
                <video controls muted autoplay loop width="100%">
                    <source src="teaser/i2v_1080/1.mp4" type="video/mp4">
                    Your browser does not support the video tag.
                </video>
                <p class="tooltiptext">A cinematic video of the text 'DC-VideoGen' formed by clouds.</p>
            </div>
            <div class="tooltip" style="text-align: center; width: 100%;">
                <video controls muted autoplay loop width="100%">
                    <source src="teaser/i2v_1080/2.mp4" type="video/mp4">
                    Your browser does not support the video tag.
                </video>
                <p class="tooltiptext">A poised blonde woman gracefully sips tea from a delicate cup.</p>
                
            </div>
            <p style="font-size: 20px; text-align: center;">
                <b>I2V 1080P</b>
            </p>
        </div>
        <div style="width: 55.25%;">
            <div class="tooltip" style="text-align: center; width: 100%;">
                <video controls muted autoplay loop width="100%">
                    <source src="teaser/t2v_2160/1.mp4" type="video/mp4">
                    Your browser does not support the video tag.
                </video>
                <p class="tooltiptext">A time-lapse video captures a flower's delicate and detailed bloom.</p>
            </div>
            <p style="font-size: 20px; text-align: center;">
                <b>T2V 2160P</b>
            </p>
        </div>
    </div>
    

    <h1 style="text-align: center; color:#A31F34;">Video Autoencoder Reconstruction Visualization</h1>
    <div style="display: flex; gap: 0.5%;">
        <div style="text-align: center; width: 20%;">
            <video controls muted autoplay loop width="100%" style="margin-bottom: 1%;">
                <source src="temporal_modeling/input_1.mp4" type="video/mp4">
                Your browser does not support the video tag.
            </video>
            <video controls muted autoplay loop width="100%" style="margin-bottom: 1%;">
                <source src="temporal_modeling/input_2.mp4" type="video/mp4">
                Your browser does not support the video tag.
            </video>
            <video controls muted autoplay loop width="100%" style="margin-bottom: 1%;">
                <source src="temporal_modeling/input_3.mp4" type="video/mp4">
                Your browser does not support the video tag.
            </video>
            <p style="font-size: 18px;">
                Input<br>
                Shape: 80x256x256
            </p>
        </div>
        <div style="text-align: center; width: 20%;">
            <video controls muted autoplay loop width="100%" style="margin-bottom: 1%;">
                <source src="temporal_modeling/ltxvae_1.mp4" type="video/mp4">
                Your browser does not support the video tag.
            </video>
            <video controls muted autoplay loop width="100%" style="margin-bottom: 1%;">
                <source src="temporal_modeling/ltxvae_2.mp4" type="video/mp4">
                Your browser does not support the video tag.
            </video>
            <video controls muted autoplay loop width="100%" style="margin-bottom: 1%;">
                <source src="temporal_modeling/ltxvae_3.mp4" type="video/mp4">
                Your browser does not support the video tag.
            </video>
            <p style="font-size: 18px;">
                LTX Video VAE (<b>Causal</b>)<br>
                Configuration: f32t8c128<br>
                Compression Ratio: 192<br>
                PSNR: 31.12
            </p>
        </div>
        <div style="text-align: center; width: 20%;">
            <video controls muted autoplay loop width="100%" style="margin-bottom: 1%;">
                <source src="temporal_modeling/opensora2ae_no_tiling_1.mp4" type="video/mp4">
                Your browser does not support the video tag.
            </video>
            <video controls muted autoplay loop width="100%" style="margin-bottom: 1%;">
                <source src="temporal_modeling/opensora2ae_no_tiling_2.mp4" type="video/mp4">
                Your browser does not support the video tag.
            </video>
            <video controls muted autoplay loop width="100%" style="margin-bottom: 1%;">
                <source src="temporal_modeling/opensora2ae_no_tiling_3.mp4" type="video/mp4">
                Your browser does not support the video tag.
            </video>
            <p style="font-size: 18px;">
                Video DC-AE (<b>Non-Causal</b>) w/o tiling<br>
                Configuration: f32t4c128<br>
                Compression Ratio: 96<br>
                PSNR: 31.52
            </p>
        </div>
        <div style="text-align: center; width: 20%;">
            <video controls muted autoplay loop width="100%" style="margin-bottom: 1%;">
                <source src="temporal_modeling/opensora2ae_1.mp4" type="video/mp4">
                Your browser does not support the video tag.
            </video>
            <video controls muted autoplay loop width="100%" style="margin-bottom: 1%;">
                <source src="temporal_modeling/opensora2ae_2.mp4" type="video/mp4">
                Your browser does not support the video tag.
            </video>
            <video controls muted autoplay loop width="100%" style="margin-bottom: 1%;">
                <source src="temporal_modeling/opensora2ae_3.mp4" type="video/mp4">
                Your browser does not support the video tag.
            </video>
            <p style="font-size: 18px;">
                Video DC-AE (<b>Non-Causal</b>) w/ tiling<br>
                Configuration: f32t4c128<br>
                Compression Ratio: 96<br>
                PSNR: 33.65
            </p>
        </div>
        <div style="text-align: center; width: 20%;">
            <video controls muted autoplay loop width="100%" style="margin-bottom: 1%;">
                <source src="temporal_modeling/dcaev_f32t4c64_1.mp4" type="video/mp4">
                Your browser does not support the video tag.
            </video>
            <video controls muted autoplay loop width="100%" style="margin-bottom: 1%;">
                <source src="temporal_modeling/dcaev_f32t4c64_2.mp4" type="video/mp4">
                Your browser does not support the video tag.
            </video>
            <video controls muted autoplay loop width="100%" style="margin-bottom: 1%;">
                <source src="temporal_modeling/dcaev_f32t4c64_3.mp4" type="video/mp4">
                Your browser does not support the video tag.
            </video>
            <p style="font-size: 18px;">
                DC-AE-V (<b>Chunk-Causal</b>)<br>
                Configuration: f32t4c32<br>
                Compression Ratio: 192<br>
                PSNR: 32.72
            </p>
        </div>
    </div>
    <!-- <p style="font-size: 20px;">
        We compare the reconstruction performance of three different temporal modeling methods: <b>Causal</b>, <b>Non-Causal</b>, and our <b>Chunk-Causal</b>.<br>
        The <b>Causal</b> design only allows information flow from former frames to latter frames. This design can well support longer videos, but the reconstruction accuracy is limited due to insufficient utilization of temporal redundancy. Compared to the <b>Causal</b> design, our <b>Chunk-Causal</b> design achieves similar reconstruction accuracy with <b>2x</b> compression ratio.<br>
        The <b>Non-Causal</b> design allows bidirectional information flow, thus achieving better reconstruction accuracy. However, as shown in the middle frames of Video DC-AE (<b>Non-Causal</b>) w/o tiling, it can not reconstruct videos well when directly extending to longer videos. The temporal tiling and blending technique can alleviate this issue, but still results in blurred reconstructions at tile boundaries.<br>
        Our <b>Chunk-Causal</b> design aims at taking adtantage of both designs by allowing bidirectional temporal modeling within each large chunk to fully utilize temporal redundancy and allowing causal information flow between chunks for better extendability to longer videos.
    </p> -->

    <h1 style="text-align: center; color:#A31F34;">Image-to-Video (I2V) Visualization</h1>
    <div style="display: flex; width: 100%;">
        <div style="display: flex; width: 50%; text-align: center;">
            <div style="width: 50%; text-align: center;">
                <p style="font-size: 20px;">
                    <b>Wan2.1-I2V-14B</b><br>
                    (<span style="color:#5E5E5E;"><b>27.88</b></span> mins/video)
                </p>
            </div>
            <div style="width: 50%; text-align: center;">
                <p style="font-size: 20px;">
                    <b>DC-VideoGen-Wan2.1-I2V-14B</b><br>
                    (<span style="color:#A31F34;"><b>3.67</b></span> mins/video)
                </p>
            </div>
        </div>
        <div style="display: flex; width: 50%; text-align: center;">
            <div style="width: 50%; text-align: center;">
                <p style="font-size: 20px;">
                    <b>Wan2.1-I2V-14B</b><br>
                    (<span style="color:#5E5E5E;"><b>27.88</b></span> mins/video)
                </p>
            </div>
            <div style="width: 50%; text-align: center;">
                <p style="font-size: 20px;">
                    <b>DC-VideoGen-Wan2.1-I2V-14B</b><br>
                    (<span style="color:#A31F34;"><b>3.67</b></span> mins/video)
                </p>
            </div>
        </div>
    </div>
    <div style="display: flex; width: 100%; gap: 0.5%;">
        <div style="width: 50%;">
            <div style="display: flex; gap: 0.5%; width: 100%; overflow: hidden;">
                <video controls muted autoplay loop width="50%">
                    <source src="i2v_comparison/wan_1.mp4" type="video/mp4">
                    Your browser does not support the video tag.
                </video>
                <video controls muted autoplay loop width="50%" style="margin-top: -1.1%">
                    <source src="i2v_comparison/dcvideogen_wan_1.mp4" type="video/mp4">
                    Your browser does not support the video tag.
                </video>
            </div>
            <p style="font-size: 18px;">
                <b>Prompt</b>:  A battle-scarred robot walks through a desolate city ruin.
            </p>
        </div>
        <div style="width: 50%;">
            <div style="display: flex; gap: 0.5%; width: 100%; overflow: hidden;">
                <video controls muted autoplay loop width="50%">
                    <source src="i2v_comparison/wan_2.mp4" type="video/mp4">
                    Your browser does not support the video tag.
                </video>
                <video controls muted autoplay loop width="50%" style="margin-top: -1.1%">
                    <source src="i2v_comparison/dcvideogen_wan_2.mp4" type="video/mp4">
                    Your browser does not support the video tag.
                </video>
            </div>
            <p style="font-size: 18px;">
                <b>Prompt</b>: A trail runner sprints through a sun-dappled forest, face set with determination.
            </p>
        </div>
    </div>
    <div style="display: flex; width: 100%; gap: 0.5%;">
        <div style="width: 50%;">
            <div style="display: flex; gap: 0.5%; width: 100%; overflow: hidden;">
                <video controls muted autoplay loop width="50%">
                    <source src="i2v_comparison/wan_3.mp4" type="video/mp4">
                    Your browser does not support the video tag.
                </video>
                <video controls muted autoplay loop width="50%" style="margin-top: -1.1%">
                    <source src="i2v_comparison/dcvideogen_wan_3.mp4" type="video/mp4">
                    Your browser does not support the video tag.
                </video>
            </div>
            <p style="font-size: 18px;">
                <b>Prompt</b>: An eaglet soars high above a vast, vibrant forest canopy.
            </p>
        </div>
        <div style="width: 50%;">
            <div style="display: flex; gap: 0.5%; width: 100%; overflow: hidden;">
                <video controls muted autoplay loop width="50%">
                    <source src="i2v_comparison/wan_4.mp4" type="video/mp4">
                    Your browser does not support the video tag.
                </video>
                <video controls muted autoplay loop width="50%" style="margin-top: -1.1%">
                    <source src="i2v_comparison/dcvideogen_wan_4.mp4" type="video/mp4">
                    Your browser does not support the video tag.
                </video>
            </div>
            <p style="font-size: 18px;">
                <b>Prompt</b>: A rugged off-road vehicle speeds through a sunlit forest track.
            </p>
        </div>
    </div>

    <h1 style="text-align: center; color:#A31F34;">Text-to-Video (T2V) Visualization</h1>
    <div style="display: flex; width: 100%;">
        <div style="display: flex; width: 50%; text-align: center;">
            <div style="width: 50%; text-align: center;">
                <p style="font-size: 20px;">
                    <b>Wan2.1-T2V-14B</b><br>
                    (<span style="color:#5E5E5E;"><b>27.52</b></span> mins/video)
                </p>
            </div>
            <div style="width: 50%; text-align: center;">
                <p style="font-size: 20px;">
                    <b>DC-VideoGen-Wan2.1-T2V-14B</b><br>
                    (<span style="color:#A31F34;"><b>3.58</b></span> mins/video)
                </p>
            </div>
        </div>
        <div style="display: flex; width: 50%; text-align: center;">
            <div style="width: 50%; text-align: center;">
                <p style="font-size: 20px;">
                    <b>Wan2.1-T2V-14B</b><br>
                    (<span style="color:#5E5E5E;"><b>27.52</b></span> mins/video)
                </p>
            </div>
            <div style="width: 50%; text-align: center;">
                <p style="font-size: 20px;">
                    <b>DC-VideoGen-Wan2.1-T2V-14B</b><br>
                    (<span style="color:#A31F34;"><b>3.58</b></span> mins/video)
                </p>
            </div>
        </div>
    </div>
    <div style="display: flex; width: 100%; gap: 0.5%;">
        <div style="width: 50%;">
            <div style="display: flex; gap: 0.5%; width: 100%; overflow: hidden;">
                <video controls muted autoplay loop width="50%">
                    <source src="t2v_comparison/wan_1.mp4" type="video/mp4">
                    Your browser does not support the video tag.
                </video>
                <video controls muted autoplay loop width="50%" style="margin-top: -1.1%">
                    <source src="t2v_comparison/dcvideogen_wan_1.mp4" type="video/mp4">
                    Your browser does not support the video tag.
                </video>
            </div>
            <p style="font-size: 18px;">
                <b>Prompt</b>: A girl on a ship's deck, clutching a letter, looks back with sad determination.
            </p>
        </div>
        <div style="width: 50%;">
            <div style="display: flex; gap: 0.5%; width: 100%; overflow: hidden;">
                <video controls muted autoplay loop width="50%">
                    <source src="t2v_comparison/wan_2.mp4" type="video/mp4">
                    Your browser does not support the video tag.
                </video>
                <video controls muted autoplay loop width="50%" style="margin-top: -1.1%">
                    <source src="t2v_comparison/dcvideogen_wan_2.mp4" type="video/mp4">
                    Your browser does not support the video tag.
                </video>
            </div>
            <p style="font-size: 18px;">
                <b>Prompt</b>: Minecraft with the most gorgeous high res 8k texture pack ever.
            </p>
        </div>
    </div>
    <div style="display: flex; width: 100%; gap: 0.5%;">
        <div style="width: 50%;">
            <div style="display: flex; gap: 0.5%; width: 100%; overflow: hidden;">
                <video controls muted autoplay loop width="50%">
                    <source src="t2v_comparison/wan_3.mp4" type="video/mp4">
                    Your browser does not support the video tag.
                </video>
                <video controls muted autoplay loop width="50%" style="margin-top: -1.1%">
                    <source src="t2v_comparison/dcvideogen_wan_3.mp4" type="video/mp4">
                    Your browser does not support the video tag.
                </video>
            </div>
            <p style="font-size: 18px;">
                <b>Prompt</b>: Three video game characters team up in a vibrant arcade.
            </p>
        </div>
        <div style="width: 50%;">
            <div style="display: flex; gap: 0.5%; width: 100%; overflow: hidden;">
                <video controls muted autoplay loop width="50%">
                    <source src="t2v_comparison/wan_4.mp4" type="video/mp4">
                    Your browser does not support the video tag.
                </video>
                <video controls muted autoplay loop width="50%" style="margin-top: -1.1%">
                    <source src="t2v_comparison/dcvideogen_wan_4.mp4" type="video/mp4">
                    Your browser does not support the video tag.
                </video>
            </div>
            <p style="font-size: 18px;">
                <b>Prompt</b>: A man is skiing down thick layers of clouds. Towering mountain peaks are faintly visible.
            </p>
        </div>
    </div>
</body>
</html>
