Modernized Training of U-Net for Aerial Semantic Segmentation

Published: 01 Jan 2024, Last Modified: 26 Feb 2025WACV (Workshops) 2024EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: In this paper, we propose an improved training protocol of U-Net architecture for the semantic segmentation of aerial images. We test our approach on the challenging FLAIR #2 dataset. We present an extensive ablation study on the influence of different approach components on the overall performance. The ablation study includes a comparison of different model backbones, image augmentations, learning rate schedulers, loss functions, and training procedures. We additionally propose a two-stage training procedure and evaluate different options for the model ensemble. Based on the results we design the final setup of the model training protocol. This final setup decreases the relative error by approximately 18% and achieves mIoU equal to 0.641, which is a new state-of-the-art result. Our code is available at: h ttps://github.com/strakaj/U-Net-for-remote-sensing.
Loading