Research history generation using maximum margin clustering of research papers based on metainformationOpen Website

Published: 01 Jan 2011, Last Modified: 16 Oct 2023iiWAS 2011Readers: Everyone
Abstract: Our research aim is the automatic generation of a researcher's research history from research articles published on the internet. Research history generation based on the k-Means clustering algorithm has been proposed in previous work. However, the performance of the k-Means algorithm is unsatisfactory. We propose a method based on Maximum Margin Clustering (MMC). MMC is a new clustering algorithm based on Support Vector Machines (SVM). It is known that MMC is better than existing clustering algorithms such as k-Means. In this paper, we describe how to convert articles into vectors using metainformation about them and how to decide an initial setting for MMC automatically. We demonstrate by experiment that the purity of a method based on MMC is about 0.58 and its entropy is about 0.415. This result is better than that achieved in previous work (purity: 0.35, entropy: 0.47).
0 Replies

Loading