MISL: Multi-grained image-text semantic learning for text-guided image inpainting

Xingcai Wu, Kejun Zhao, Qianding Huang, Qi Wang, Zhenguo Yang, Gefei Hao

Published: 2024, Last Modified: 18 Jun 2024Pattern Recognit. 2024EveryoneRevisionsBibTeXCC BY-SA 4.0

Abstract: Highlights•Proposed a hierarchical learning method for text-guided image inpainting.•The object-fine-grained learning stage focuses on the visual semantics of objects of interest.•Designed a mask reconstruction module focusing on the object of interest.•Explored a multi-attention mechanism to fuse visual and textual semantics.•Devised a flexible discriminator to penalize the corrupted area.