<!DOCTYPE html>

<html lang="en">
<head>
<meta charset="utf-8"/>
<meta content="width=device-width, initial-scale=1.0" name="viewport"/>
<title>Why Image Tokenization Needs a Rethink</title>
<script src="https://cdn.tailwindcss.com"></script>
<link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.5.2/css/all.min.css" rel="stylesheet"/>
<style>
        body {
            min-height: 100vh;
            display: flex;
            justify-content: center;
            align-items: center;
            background: #f5f5f5;
            font-family: 'Computer Modern', 'Latin Modern', 'Times New Roman', serif;
            padding: 2rem;
            color: #000000;
            line-height: 1.4;
        }

        .slide {
            width: 1440px;
            height: 810px;
            background: #ffffff;
            border-radius: 0;
            border: 2px solid #003262;
            box-shadow: 0 4px 8px rgba(0,0,0,0.15);
            overflow: hidden;
            position: relative;
            display: flex;
            flex-direction: column;
            
            background-image:
                radial-gradient(circle at 12% 10%, rgba(0,50,98,0.05) 0, transparent 24%),
                radial-gradient(circle at 84% 18%, rgba(253,181,21,0.04) 0, transparent 22%);
            
            border-left: 4px solid #FDB515;
            border-right: 4px solid #FDB515;
        }

        .slide::before {
            content: "";
            position: absolute;
            top: 0;
            left: 0;
            width: 100%;
            height: 50px;
            background: linear-gradient(135deg, #003262 0%, #FDB515 100%);
            z-index: 0;
        }

        .slide::after {
            content: "";
            position: absolute;
            bottom: 0;
            left: 0;
            width: 100%;
            height: 30px;
            background: linear-gradient(135deg, #FDB515 0%, #003262 100%);
            z-index: 0;
        }

        .slide-header {
            position: absolute;
            top: 0;
            left: 0;
            width: 100%;
            height: 50px;
            display: flex;
            align-items: center;
            justify-content: center;
            z-index: 2;
            color: #ffffff;
            font-size: 1.7rem;
            font-weight: bold;
            text-shadow: 0 1px 3px rgba(0,0,0,0.7);
        }
        
        .content-container {
            position: relative;
            z-index: 1;
            padding: 90px 40px 50px 40px; /* 50px header + 40px gap */
            flex-grow: 1;
            display: flex;
            flex-direction: column;
            justify-content: center;
            height: 100%;
            box-sizing: border-box;
        }
        
        /* Enforce aspect ratio and boundary constraints for placeholders */
        div[data-image-id] {
            max-width: 100%;
            max-height: 100%;
            box-sizing: border-box; /* Ensures padding and border are included in the element's total width and height */
        }

        /* Custom styles for this specific slide */
        .main-content-grid {
            display: grid;
            grid-template-columns: 1fr 2px 1fr;
            gap: 50px;
            width: 100%;
            height: 100%;
            max-height: 600px;
            align-items: center;
        }
        
        .separator {
            width: 2px;
            height: 80%;
            background-color: #FDB515;
            border-radius: 1px;
            justify-self: center;
            animation: grow-height 1s ease-out forwards;
            transform-origin: center;
        }

        @keyframes grow-height {
            from { transform: scaleY(0); }
            to { transform: scaleY(1); }
        }

        .content-block {
            display: flex;
            flex-direction: column;
            gap: 1rem;
            animation: fadeIn 1s ease-out forwards;
            opacity: 0;
        }

        .left-column {
            justify-content: center;
            animation-delay: 0.2s;
        }
        
        .right-column {
            display: flex;
            flex-direction: column;
            justify-content: space-around;
            gap: 4rem;
            height: 100%;
        }

        .right-column .content-block:nth-child(1) {
            animation-delay: 0.4s;
        }

        .right-column .content-block:nth-child(2) {
            animation-delay: 0.6s;
        }

        .content-block h3 {
            font-size: 2rem;
            font-weight: bold;
            color: #003262;
            display: flex;
            align-items: center;
            gap: 1rem;
        }
        
        .content-block h3 i {
            color: #003262;
            font-size: 1.8rem;
        }

        .content-block p {
            font-size: 1.5rem;
            line-height: 1.6;
            color: #333;
            padding-left: 3.8rem; /* Aligns text with title text, past the icon */
        }
        
        .highlight-text {
            background: linear-gradient(120deg, rgba(253, 181, 21, 0.25) 0%, rgba(253, 181, 21, 0.1) 100%);
            padding: 0.1rem 0.4rem;
            border-radius: 4px;
            font-weight: 600;
        }

        @keyframes fadeIn {
            from {
                opacity: 0;
                transform: translateY(20px);
            }
            to {
                opacity: 1;
                transform: translateY(0);
            }
        }

    </style>
<script id="mathjax-config-stable" type="text/javascript">
        window.MathJax = {
            tex: {
                inlineMath: [['$','$'], ['\\(','\\)']],
                displayMath: [['$$','$$'], ['\\[','\\]']],
                processEscapes: true,
                processEnvironments: true,
                macros: {
                    'R': '\\mathbb{R}',
                    'E': '\\mathbb{E}',
                    'bm': ['\\boldsymbol{#1}',1],
                    'norm': ['\\left\\|#1\\right\\|',1]
                }
            },
            options: {
                skipHtmlTags: ['script','noscript','style','textarea']
            },
            svg: {
                scale: 1,
                mtextInheritFont: true
            },
            chtml: {
                scale: 1,
                mtextInheritFont: true
            },
            responsive: {
                enabled: true
            },
            startup: {
                ready: function () {
                    MathJax.startup.defaultReady();
                    window._mathjaxReady = true;
                }
            }
        };
        </script><script defer="" id="MathJax-script-stable" src="https://cdn.jsdelivr.net/npm/mathjax@3/es5/tex-chtml.js" type="text/javascript"></script><script type="text/javascript">
        setTimeout(function() {
            if (!window.MathJax) {
                var s = document.createElement('script');
                s.src = 'https://cdnjs.cloudflare.com/ajax/libs/mathjax/3.2.2/es5/tex-chtml.js';
                s.defer = true;
                s.id = 'MathJax-script-fallback';
                document.head.appendChild(s);
            }
        }, 2000);
        </script><style id="equation-styles-stable">
        /* 稳定的公式样式 */
        .equation-group {
            display: flex;
            flex-direction: column;
            align-items: center;
            margin: 12px auto;
            padding: 0;
            width: 100%;
            --eq-base-color: rgb(0, 0, 0);
            --eq-variable-color: rgb(0, 0, 0);
            --eq-operator-color: rgb(0, 0, 0);
            --eq-number-color: rgb(0, 0, 0);
            --eq-text-color: rgb(0, 0, 0);
            --eq-font-size: 18px;
        }
        
        .equation-group .equation {
            display: block;
            text-align: center;
            margin: 0;
            padding: 0;
            width: 100%;
        }
        
        .equation-group .equation .MathJax {
            display: block !important;
            margin: 0 auto !important;
            font-size: var(--eq-font-size) !important;
            color: var(--eq-base-color) !important;
        }
        
        .equation-group .variables {
            display: block;
            width: 100%;
            margin: 6px auto 0 auto;
            padding: 0;
            text-align: left;
            font-size: var(--eq-font-size);
        }
        
        .equation-group .variables ul {
            margin: 4px 0 0 0;
            padding: 0;
            list-style: none;
            display: flex;
            flex-wrap: wrap;
            justify-content: center;
            gap: 8px 16px;
        }
        
        .equation-group .variables ul li {
            margin: 0;
            white-space: nowrap;
            font-size: 14px;
        }
        </style></head>
<body>
<div class="slide">
<div class="slide-header">
<span>Why Image Tokenization Needs a Rethink</span>
</div>
<div class="content-container">
<div class="main-content-grid">
<div class="content-block left-column">
<h3>
<i class="fa-solid fa-scale-unbalanced-flip"></i>
                        The Quality vs. Compression Dilemma
                    </h3>
<p>
                        Traditional tokenizers are caught in a <span class="highlight-text">tug-of-war</span>, forced to sacrifice image fidelity and fine detail to achieve efficient compression for model training and storage.
                    </p>
</div>
<div class="separator"></div>
<div class="right-column">
<div class="content-block">
<h3>
<i class="fa-solid fa-microchip"></i>
                            The High Cost of High Resolution
                        </h3>
<p>
                            Generating detailed, high-resolution images remains <span class="highlight-text">computationally demanding</span>, creating a bottleneck for speed and accessibility.
                        </p>
</div>
<div class="content-block">
<h3>
<i class="fa-solid fa-puzzle-piece"></i>
                            Loss of Semantic Coherence
                        </h3>
<p>
                            Aggressive compression often strips away context, leading to a loss of <span class="highlight-text">semantic meaning</span> and visual coherence in generated outputs.
                        </p>
</div>
</div>
</div>
</div>
</div>
</body>
</html>