Summarization of Multimodal Presentations with Vision-Language Models: Study of the Effect of Modalities and Structure

Théo Gigant, Camille Guinaudeau, Frédéric Dufaux

Published: 2025, Last Modified: 02 Apr 2026CoRR 2025EveryoneRevisionsBibTeXCC BY-SA 4.0
Loading