A study of multi-task learning using a VoVNet-OSA block enhanced U-Net on the Med++ MNIST dataset

SYUAN-HAO LI

Published: 26 Oct 2024, Last Modified: 04 Mar 2025IET ICETA 2025EveryoneCC BY 4.0

Abstract: In this study, we propose a multi-task learning framework using a modified VoVNet-OSA Block Enhanced UNet, named VovUnet_Var, for image segmentation and classification on the Med++ MNIST dataset. VovUnet_Var features downsampling (DownOsa) and upsampling (UpOsa) blocks, with a classification head. The architecture sequentially downscales the input image to capture hierarchical features and uses adaptive average pooling and a fully connected layer for classification. For segmentation, upsampling layers restore spatial dimensions, producing segmentation masks. This model effectively handles complex medical imaging tasks, providing a robust solution for simultaneous image segmentation and classification.