Big Scholarly Data in CiteSeerX: Information Extraction from the WebOpen Website

2015 (modified: 12 Nov 2022)WWW (Companion Volume) 2015Readers: Everyone
Abstract: We examine CiteSeerX, an intelligent system designed with the goal of automatically acquiring and organizing large-scale collections of scholarly documents from the world wide web. From the perspective of automatic information extraction and modes of alternative search, we examine various functional aspects of this complex system with an eye towards ongoing and future research developments.
0 Replies

Loading