Intrinsic LoRA: A Generalist Approach for Discovering Knowledge in Generative Models

Xiaodan Du; Nicholas Kolkin; Greg Shakhnarovich; Anand Bhattad

Intrinsic LoRA: A Generalist Approach for Discovering Knowledge in Generative Models

Xiaodan Du, Nicholas Kolkin, Greg Shakhnarovich, Anand Bhattad

Published: 09 Apr 2024, Last Modified: 11 Apr 2024SynData4CVEveryoneRevisionsBibTeXCC BY 4.0

Keywords: Visual knowledge, Generative models, Efficient fine-tuning

TL;DR: We introduce Intrinsic LoRA (I-LoRA), a general approach that uses Low-Rank Adaptation (LoRA) to discover scene intrinsics such as normals, depth, albedo, and shading from a wide array of generative models.

Abstract: Generative models have been shown to be capable of creating images that closely mimic real scenes, suggesting they inherently encode scene representations. We introduce Intrinsic LoRA (I-LoRA), a general approach that uses Low-Rank Adaptation (LoRA) to discover scene intrinsics such as normals, depth, albedo, and shading from a wide array of generative models. I-LoRA is lightweight, adding minimally to the model's parameters and requiring very small datasets for this knowledge discovery. Our approach, applicable to Diffusion models, GANs, and Autoregressive models alike, generates intrinsics using the same output head as the original images.

Submission Number: 6

Loading