Abstract: Highlights•Unified representation omits the specific modality information.•Various modalities data share certain level similarity.•Modal relations re-weight multimodal specific predictive heads can facilitate generalization.•Detailed theoretical analysis and rigorous proof can support our method.
External IDs:dblp:journals/inffus/WangSWSJ26
Loading