Abstract: Highlights•We propose a multimodal incongruity adjustment strategy.•We pull closer consistent embeddings and push apart inconsistent embeddings.•We statically and dynamically capture the inter- and intra-layer topology.•Extensive experiments demonstrate the superiority of our approach.
Loading