High-Precision for Multi-Task Learning from In-Vehicle Camera using BiFPN

Chenyu Zhang; Hidenori Itaya; Tsubasa Hirakawa; Takayoshi Yamashita; Hironobu Fujiyoshi

High-Precision for Multi-Task Learning from In-Vehicle Camera using BiFPN

Chenyu Zhang, Hidenori Itaya, Tsubasa Hirakawa, Takayoshi Yamashita, Hironobu Fujiyoshi

Published: 01 Jan 2024, Last Modified: 13 Nov 2024IV 2024EveryoneRevisionsBibTeXCC BY-SA 4.0

Abstract: Multi-task learning is effective for object detection and segmentation, which are closely related to each other and necessary for automated driving. However, there is a problem with the learning process in conventional multi-task learning models. In multi-task learning, common features among downstream tasks are first extracted by a backbone network. Then, these features are used for different downstream tasks. Since the required feature is different depending on the downstream task, it is necessary to extract features suitable for each downstream task. In this paper, we propose a multi-tasking model that introduces BiFPN feature fusion method for automated driving tasks and the Next-ViT model utilizing CNN and Transformer to extract features. From the evaluation experiments of automated driving tasks, we confirmed that the proposed method improves the accuracy of multi-task learning.

Loading