Abstract
XML-enabled association rule framework [FDWC03] extends the notion of associated items to XML fragments to present associations among trees rather than simple-structured items of atomic values. They are more flexible and powerful in representing both simple and complex structured association relationships inherent in XML data. Compared with traditional association mining in the well-structured world, mining from XML data, however, is confronted with more challenges due to the inherent flexibilities of XML in both structure and semantics. The primary challenges include 1) a more complicated hierarchical data structure; 2) an ordered data context; and 3) a much bigger data size. In order to make XML-enabled association rule mining truly practical and computationally tractable, in this study, we present a template model to help users specify the interesting XML-enabled associations to be mined. Techniques for template-guided mining of association rules from large XML data are also described in the paper. We demonstrate the effectiveness of these techniques through a set of experiments on both synthetic and real-life data.
Original language | Undefined |
---|---|
Pages | 66-88 |
Number of pages | 23 |
DOIs | |
Publication status | Published - 2004 |
Event | Third International Workshop on Knowledge Discovery in Inductive Databases (KDID 2004) - Pisa, Italy Duration: 20 Sept 2004 → 20 Sept 2004 |
Workshop
Workshop | Third International Workshop on Knowledge Discovery in Inductive Databases (KDID 2004) |
---|---|
Period | 20/09/04 → 20/09/04 |
Other | September 20, 2004 |
Keywords
- EWI-7234
- IR-63504
- DB-DM: DATA MINING