Dual-modality learning and transformer-based approach for high-quality vector font generation

Yu Liu, Fatimah Khalid, Mas Rina Mustaffa, Azreen bin Azman

Published: 2024, Last Modified: 07 Nov 2024Expert Syst. Appl. 2024EveryoneRevisionsBibTeXCC BY-SA 4.0

Abstract: Highlights•A bimodal learning strategy is proposed to generate vector images from glyph images.•Alignment of word image modalities and sequence modalities mapped to discrete space.•The ideas of Sliding Window attention and RevNet are used in Transformer.•Vector images are generated from raster images by cross-modal model distillation.•Complex vector font synthesis is achieved with important application value.