Terminology/Keyphrase Extraction for Creation of Book Indexes in PolishOpen Website

2021 (modified: 08 Feb 2022)TPDL 2021Readers: Everyone
Abstract: The paper addresses the problem of automatic identification of phrases to be included in back-of-book indexes. We analyzed books in Polish and English published with subject indexes compiled by their authors. We checked what kinds of phrases are placed in those indexes and how often they actually occur in the corresponding books. In the experiments, we use existing terminology and keyphrase extraction tools. For Polish, the first tool is better than the second one, but for English texts, the results are inconclusive.
0 Replies

Loading