KeySync: A Robust Approach for Leakage-free Lip Synchronization in High Resolution

Antoni Bigata Casademunt; Rodrigo Mira; Stella Bounareli; Michał Stypułkowski; Konstantinos Vougioukas; Stavros Petridis; Maja Pantic

KeySync: A Robust Approach for Leakage-free Lip Synchronization in High Resolution

Antoni Bigata Casademunt, Rodrigo Mira, Stella Bounareli, Michał Stypułkowski, Konstantinos Vougioukas, Stavros Petridis, Maja Pantic

15 Sept 2025 (modified: 17 Nov 2025)ICLR 2026 Conference Withdrawn SubmissionEveryoneRevisionsBibTeXCC BY 4.0

Keywords: diffusion model, audio driven animation, lip-sync

TL;DR: KeySync generates high-resolution, realistic lip-synced videos by preventing expression "leakage" from the original clip and handling facial occlusions

Abstract: Lip synchronization, known as the task of aligning lip movements in an existing video with new input audio, is typically framed as a simpler variant of audio-driven facial animation. However, as well as suffering from the usual issues in talking head generation (e.g., temporal consistency), lip synchronization presents significant new challenges such as expression leakage from the input video and facial occlusions, which can severely impact real-world applications like automated dubbing, but are largely neglected by existing works. To address these shortcomings, we present KeySync, a two-stage framework that succeeds in solving the issue of temporal consistency, while also incorporating solutions for leakage and occlusions using a carefully designed masking strategy. We show that KeySync achieves state-of-the-art results in lip reconstruction and cross-synchronization, improving visual quality and reducing expression leakage according to LipLeak, our novel leakage metric. Furthermore, we demonstrate the effectiveness of our new masking approach in handling occlusions and validate our architectural choices through several ablation studies. Our code and models will be made publicly available.

Supplementary Material: zip

Primary Area: generative models

Submission Number: 6091

Loading