Meta-Learning via Classifier(-free) Diffusion Guidance

Elvis Nava; Seijin Kobayashi; Yifei Yin; Robert K. Katzschmann; Benjamin F Grewe

Meta-Learning via Classifier(-free) Diffusion Guidance

Elvis Nava, Seijin Kobayashi, Yifei Yin, Robert K. Katzschmann, Benjamin F Grewe

Published: 14 Aug 2023, Last Modified: 17 Sept 2024Accepted by TMLREveryoneRevisionsBibTeXCC BY 4.0

Abstract: We introduce meta-learning algorithms that perform zero-shot weight-space adaptation of neural network models to unseen tasks. Our methods repurpose the popular generative image synthesis techniques of natural language guidance and diffusion models to generate neural network weights adapted for tasks. We first train an unconditional generative hypernetwork model to produce neural network weights; then we train a second "guidance" model that, given a natural language task description, traverses the hypernetwork latent space to find high-performance task-adapted weights in a zero-shot manner. We explore two alternative approaches for latent space guidance: "HyperCLIP"-based classifier guidance and a conditional Hypernetwork Latent Diffusion Model ("HyperLDM"), which we show to benefit from the classifier-free guidance technique common in image generation. Finally, we demonstrate that our approaches outperform existing multi-task and meta-learning methods in a series of zero-shot learning experiments on our Meta-VQA dataset.

Submission Length: Regular submission (no more than 12 pages of main content)

Video: https://www.youtube.com/watch?v=O6lB2RaBh2k

Code: https://github.com/elvisnava/hyperclip

Supplementary Material: zip

Assigned Action Editor: ~Vincent_Dumoulin1

License: Creative Commons Attribution 4.0 International (CC BY 4.0)

Submission Number: 1139

Loading