FontTransformer: Few-shot high-resolution Chinese glyph image synthesis via stacked transformers

Published: 01 Jan 2023, Last Modified: 15 Dec 2024Pattern Recognit. 2023EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Highlights•We propose FontTransformer, a novel few-shot Chinese font synthesis model, using stacked transformers to synthesize high-resolution (e.g., 256×256 or 1024×1024) glyph images. To the best of our knowledge, this is the first work that effectively applies Transformers on the task of few-shot Chinese font synthesis.•We design a novel chunked glyph image encoding scheme to encode glyph images into token sequences. With this encoding scheme, our method can synthesize arbitrarily high-resolution glyph images by keeping the length of the token sequence a constant.•Extensive experiments have been conducted to demonstrate that our method is capable of synthesizing high-quality glyph images in the target font style from a few input samples, outperforming the state of the art both quantitatively and qualitatively.
Loading

OpenReview is a long-term project to advance science through improved peer review with legal nonprofit status. We gratefully acknowledge the support of the OpenReview Sponsors. © 2025 OpenReview