Taking Advantage of Out-of-Corpus Information for Citation Network ClusteringDownload PDF

26 Apr 2024 (modified: 08 May 2013)ICML 2013 PeerReview submissionReaders: Everyone
Decision: oral
Abstract: In this paper we explore the use of several popular clustering and graph partitioning algorithms as a method of generating clusters of related scientific documents and suggest a simple graph augmentation technique for taking advantage of external information. We show that by hallucinating nodes for scientific documents that are cited but not present in the original data set, we can improve performance of clustering algorithms.
0 Replies

Loading