Exploring an efficient frequency-guidance transformer for single image deraining

Tianyu Song; Shumin Fan; Jiyu Jin; Guiyue Jin; Lei Fan

Exploring an efficient frequency-guidance transformer for single image deraining

Tianyu Song, Shumin Fan, Jiyu Jin, Guiyue Jin, Lei Fan

Published: 01 Jan 2024, Last Modified: 05 Mar 2025Signal Image Video Process. 2024EveryoneRevisionsBibTeXCC BY-SA 4.0

Abstract: In this paper, we propose an Efficient Frequency-Guided image deraining transformer, called former, to explore the more useful self-attention values from the frequency domain for better image deraining. Inspired by the traditional convolution theorem, we design frequency domain guidance attention to learn rich global and local dependencies. Firstly, we employ affine coupling to increase receptive fields implicitly, enabling the capturing of multi-scale spatial feature representations, and then they are transferred to the frequency domain using the Fourier transform. Instead of using vanilla attention, we adopt element-wise product to model global frequency information for better feature aggregation and reducing spatial complexity. As traditional feed-forward networks struggle with frequency information, we introduce an adaptive frequency collaborative block to adaptively learn frequency information and integrate local spatial information for improved image restoration. Moreover, a scale feature enhancement block is designed to exchange and aggregate information at different scales for learning mixed features of various scales. Extensive experimental results on commonly used benchmark datasets demonstrate that our method outperforms competitive methods in terms of performance.

Loading