Unsupervised Image to Sequence Translation with Canvas-Drawer Networks

Kevin Frans; Chin-Yi Cheng

Unsupervised Image to Sequence Translation with Canvas-Drawer Networks

Kevin Frans, Chin-Yi Cheng

27 Sept 2018 (modified: 08 Jun 2025)ICLR 2019 Conference Blind SubmissionReaders: Everyone

Abstract: Encoding images as a series of high-level constructs, such as brush strokes or discrete shapes, can often be key to both human and machine understanding. In many cases, however, data is only available in pixel form. We present a method for generating images directly in a high-level domain (e.g. brush strokes), without the need for real pairwise data. Specifically, we train a ”canvas” network to imitate the mapping of high-level constructs to pixels, followed by a high-level ”drawing” network which is optimized through this mapping towards solving a desired image recreation or translation task. We successfully discover sequential vector representations of symbols, large sketches, and 3D objects, utilizing only pixel data. We display applications of our method in image segmentation, and present several ablation studies comparing various configurations.

Keywords: image, translation, unsupervised, model-based

TL;DR: Recreate images as interpretable high-level sequences without the need for paired data.

Community Implementations: [![CatalyzeX](/images/catalyzex_icon.svg) 1 code implementation](https://www.catalyzex.com/paper/unsupervised-image-to-sequence-translation/code)

13 Replies

Loading