The GAN is dead; long live the GAN! A Modern GAN Baseline

Nick Huang; Aaron Gokaslan; Volodymyr Kuleshov; James Tompkin

The GAN is dead; long live the GAN! A Modern GAN Baseline

Nick Huang, Aaron Gokaslan, Volodymyr Kuleshov, James Tompkin

Published: 25 Sept 2024, Last Modified: 13 Jan 2025NeurIPS 2024 posterEveryoneRevisionsBibTeXCC BY 4.0

Keywords: GAN

TL;DR: Makes GAN training stable with math (!), then uses this to create simple modern baseline architecture with SOTA-competitive performance.

Abstract: There is a widely-spread claim that GANs are difficult to train, and GAN architectures in the literature are littered with empirical tricks. We provide evidence against this claim and build a modern GAN baseline in a more principled manner. First, we derive a well-behaved regularized relativistic GAN loss that addresses issues of mode dropping and non-convergence that were previously tackled via a bag of ad-hoc tricks. We analyze our loss mathematically and prove that it admits local convergence guarantees, unlike most existing relativistic losses. Second, this loss allows us to discard all ad-hoc tricks and replace outdated backbones used in common GANs with modern architectures. Using StyleGAN2 as an example, we present a roadmap of simplification and modernization that results in a new minimalist baseline---R3GAN. Despite being simple, our approach surpasses StyleGAN2 on FFHQ, ImageNet, CIFAR, and Stacked MNIST datasets, and compares favorably against state-of-the-art GANs and diffusion models. Code: https://www.github.com/brownvc/R3GAN

Primary Area: Generative models

Submission Number: 5381

Loading