An Efficient Algorithm for Mining String Databases Under ConstraintsOpen Website

2004 (modified: 02 Nov 2022)KDID 2004Readers: Everyone
Abstract: We study the problem of mining substring patterns from string databases. Patterns are selected using a conjunction of monotonic and anti-monotonic predicates. Based on the earlier introduced version space tree data structure, a novel algorithm for discovering substring patterns is introduced. It has the nice property of requiring only one database scan, which makes it highly scalable and applicable in distributed environments, where the data are not necessarily stored in local memory or disk. The algorithm is experimentally compared to a previously introduced algorithm in the same setting.
0 Replies

Loading