Abstract: Highlights•We propose FontTransformer, a novel few-shot Chinese font synthesis model, using stacked transformers to synthesize high-resolution (e.g., 256×256 or 1024×1024) glyph images. To the best of our knowledge, this is the first work that effectively applies Transformers on the task of few-shot Chinese font synthesis.•We design a novel chunked glyph image encoding scheme to encode glyph images into token sequences. With this encoding scheme, our method can synthesize arbitrarily high-resolution glyph images by keeping the length of the token sequence a constant.•Extensive experiments have been conducted to demonstrate that our method is capable of synthesizing high-quality glyph images in the target font style from a few input samples, outperforming the state of the art both quantitatively and qualitatively.
Loading