ToFM: Topic-specific Facet Mining by Facet Propagation within ClustersDownload PDFOpen Website

Published: 01 Jan 2021, Last Modified: 28 Apr 2023ICBK 2021Readers: Everyone
Abstract: Mining the facets of topics is an essential task for information retrieval, information extraction and knowledge base construction. For the topics in courses, there are three challenges: different topics have different facet, the labels of facets rarely appear in the topic description text and not all topics have enough textural information to mine facets. In this paper we propose a weakly-supervised algorithm for topic-specific facet mining (ToFM for short) based on our finding that similar topics in a cluster have similar facet sets. For example, topics Binary Search Tree, Suffix Tree and AVL tree in Tree cluster have example, insertion, deletion, traversal and other similar facets. ToFM first splits topics in a domain into several topic clusters based on the topic description text. Then ToFM extracts initial facet sets for all topics from the corresponding Wikipedia article pages. Finally, ToFM performs a normalized facet propagation within each topic cluster to acquire final facet sets of every topic. We evaluate the performance of ToFM on six real-world datasets and experimental results show that ToFM achieves better performance than the existing facet mining algorithms.
0 Replies

Loading