FAC-FACodec: Controllable Zero-Shot Foreign Accent Conversion with Factorized Speech Codec

Published: 28 Apr 2026, Last Modified: 28 Apr 2026MSLD 2026 PosterEveryoneRevisionsCC BY 4.0
Keywords: accent conversion, voice conversion, diffusion models, pronunciation modification
TL;DR: We present the first foreign accent conversion framework with an explicit control knob that lets users trade off stronger pronunciation conversion against speaker identity preservation.
Abstract: Previous accent conversion (AC) methods, including foreign accent conversion (FAC), lack explicit control over the degree of modification. Because accent modification can alter the perceived speaker identity, balancing conversion strength and identity preservation is crucial. We present an AC framework that provides an explicit, user‑controllable parameter to adjust the strength of pronunciation-level accent modification. Results show performance comparable to recent AC systems, stronger preservation of speaker identity, and unique support for controllable accent conversion.
Email Sharing: We authorize the sharing of all author emails with Program Chairs.
Data Release: We authorize the release of our submission and author names to the public in the event of acceptance.
Submission Number: 35
Loading