Conditional Hallucinations for Image Compression

Till Aczel; Roger Wattenhofer

Conditional Hallucinations for Image Compression

Till Aczel, Roger Wattenhofer

Published: 09 Oct 2024, Last Modified: 19 Nov 2024Compression Workshop @ NeurIPS 2024EveryoneRevisionsBibTeXCC BY 4.0

Keywords: Learned image compression, compression, alignment

Abstract: In lossy image compression, models face the challenge of either hallucinating details or generating out-of-distribution samples due to the information bottleneck. This implies that at times, introducing hallucinations is necessary to generate in-distribution samples. The optimal level of hallucination varies depending on image content, as humans are sensitive to small changes that alter the semantic meaning. We propose a novel compression method that dynamically balances the degree of hallucination based on content. We collect data and train a model to predict user preferences on hallucinations. By using this prediction to adjust the perceptual weight in the reconstruction loss, we develop a \textbf{Con}ditionally \textbf{Ha}llucinating compression model (\textbf{ConHa}) that outperforms state-of-the-art image compression methods. Code and images are available at \href{https://polybox.ethz.ch/index.php/s/owS1k5JYs4KD4TA}{https://polybox.ethz.ch/index.php/s/owS1k5JYs4KD4TA}.

Submission Number: 27

Loading