Automatic Text Summarization Based on Transportation Network and Word Mover's Distances Embeddings: A Toy Experiment

Published: 01 Jan 2024, Last Modified: 04 Feb 2025MICAI (2) 2024EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: This paper presents CafeTAL, a new Automatic Text Summarization algorithm by extraction. CafeTAL is based on a Transportation Network model combined with Word Mover’s Distance using Word Embeddings. The document employed in the reported experiments is a French language document composed of two different topics, including polysemous words. In order to get significant statistics, our evaluation protocol is based on the ROUGE metric using a vast number (245) of human judges. Our results are very encouraging.
Loading