Dual Path Cross-Scale Attention Network For Image Inpainting

Published: 01 Jan 2022, Last Modified: 17 Apr 2025ICIP 2022EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Existing deep inpainting methods often generate contents with blurry textures and distorted structures, which manifests to well fuse the high-level structures and the low-level textures remains challenging. Meanwhile, current inpainting networks just process features at the encoder side but neglect the strong generative abilities in decoder pathway. To address above challenges, we propose a novel Dual Path Cross-Scale Attention Network (DPCSAN). It adopts U-Net architecture as backbone. Based on contextual attention, a Cross-Scale Attention Module (CSAM) is designed to transfer high-level structure information to adjacent low-level features across scales. CSAMs are equipped between each encoder layer and the symmetric decoder layer at multiple levels. The purpose of this design is to fill holes in shallower feature maps and to refine features generated by upsampling (with higher resolution). Experiments on Facade and CelebA-HQ demonstrate that our model performs better than state-of-the-art methods, while generating richer details and keeping original structures.
Loading

OpenReview is a long-term project to advance science through improved peer review with legal nonprofit status. We gratefully acknowledge the support of the OpenReview Sponsors. © 2025 OpenReview