Multiresolution Textual InversionDownload PDF

27 Sept 2022, 20:35 (modified: 29 Nov 2022, 14:44)SBM 2022 OralReaders: Everyone
Keywords: textual inversion, diffusion, personalized generation, text-to-image
TL;DR: We extend Textual Inversion to learn pseudo-words that represent a concept at different resolutions.
Abstract: We extend Textual Inversion to learn pseudo-words that represent a concept at different resolutions. This allows us to generate images that use the concept at different resolutions and also to manipulate different resolutions using language. Once learned, the user can generate images that agree with the original concept at different levels of detail; ``A photo of $S^*(0)$'' produces the exact object while the prompt ``A photo of $S^*(0.8)$'' only matches the rough outlines and colors. Our framework allows us to generate images that use different resolutions of an image (e.g. details, textures, styles) as separate pseudo-words that can be composed in various ways.
Student Paper: Yes
1 Reply

Loading