Template-independent wrapper for web forumsOpen Website

2009 (modified: 12 Nov 2022)SIGIR 2009Readers: Everyone
Abstract: This paper presents a novel work on the task of extracting data from Web forums. Millions of users contribute rich information to Web forum everyday, which has become an important resource for manyWeb applications, such as product opinion retrieval, social network analysis, and so on. The novelty of the proposed algorithm is that it can not only extract the pure text but also distinguish between the original post and replies. Experimental results on a large number of real Web forums indicate that the proposed algorithm can correctly ex
0 Replies

Loading