TFUT: Task fusion upward transformer model for multi-task learning on dense prediction

Published: 01 Jan 2024, Last Modified: 13 Jun 2025Comput. Vis. Image Underst. 2024EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Highlights•Propose a Transformer based multi-task model on dense prediction.•Propose an asymmetric attention based task interaction method with task guidance.•Design high-quality and low-cost upsampling method to avoid image detail loss.•Incorporate CNN into Transformer to model both local objects and global spatial relationships simultaneously.•Achieve optimal multi-task performance on public datasets NYUD-v2 and PASCAL Context.
Loading