Abstract: Highlights•We study Scientific Multimodal Summarization with Multimodal Output (SMSMO).•We propose a cross-modality model (with multimodal objectives) to perform SMSMO.•We construct two datasets to assess the performance of SMSMO models.
Loading