Abstract: Highlights•We propose a knowledge fusion network for multimodal sarcasm detection.•Commonsense knowledge is incorporated in our model.•The cross-modal semantic similarity detection modules are designed.•The model achieves better performance than strong baselines on sarcasm detection.
Loading