Resource-efficient image inpainting

Dharshan Sampath Kumar; Pranav Jeevan P; Amit Sethi

Resource-efficient image inpainting

Dharshan Sampath Kumar, Pranav Jeevan P, Amit Sethi

01 Mar 2023 (modified: 01 Jun 2023)Submitted to Tiny Papers @ ICLR 2023Readers: Everyone

Keywords: Resource efficient, token mixer, computer vision, image inpainting

TL;DR: An investigation into the application of resource efficient token mixing networks (Metaformers), for generative task of image inpainting.

Abstract: Image inpainting refers to the synthesis of missing regions in an image, which can help restore occluded or degraded areas and also serve as a precursor task for self-supervision. The current state-of-the-art models for image inpainting are computationally heavy as they are based on vision transformer backbones in adversarial or diffusion settings. This paper diverges from vision transformers by using a computationally-efficient WaveMix-based fully convolutional architecture, which uses a 2D-discrete wavelet transform (DWT) for spatial and multi-resolution token-mixing along with convolutional layers. The proposed model outperforms the current-state-of-the-art models for large mask inpainting on reconstruction quality while also using less than half the parameter count and considerably lower training and evaluation times.

8 Replies

Loading