Joint Semantic Segmentation using representations of LiDAR point clouds and camera images

Published: 01 Jan 2024, Last Modified: 08 Apr 2025Inf. Fusion 2024EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Highlights•We revisit the key factor of LiDAR-camera fusion, namely the soft joint mechanism.•We develop an attention-based multimodal fusion in point cloud segmentation.•We build multi-scale pairwise inputs and interact using the dual-stream transformer.•We propose unimodal data augmentation and cross-modal contrastive learning.
Loading