Abstract
Describe set-oriented algorithms for mining association rules. Such algorithms imply performing multiple joins and may appear to be inherently less efficient than special-purpose algorithms. We develop new algorithms that can be expressed as SQL queries, and discuss the optimization of these algorithms. After analytical evaluation, an algorithm named SETM emerges as the algorithm of choice. SETM uses only simple database primitives, viz. sorting and merge-scan join. SETM is simple, fast and stable over the range of parameter values. The major contribution of this paper is that it shows that at least some aspects of data mining can be carried out by using general query languages such as SQL, rather than by developing specialized black-box algorithms. The set-oriented nature of SETM facilitates the development of extensions
Original language | English |
---|---|
Title of host publication | Proceedings of the Eleventh International Conference on Data Engineering, ICDE 1995 |
Place of Publication | Los Alamitos, CA |
Publisher | IEEE |
Pages | 25-33 |
ISBN (Print) | 0-8186-69101 |
DOIs | |
Publication status | Published - 21 Feb 1995 |
Event | 11th International Conference on Data Engineering, ICDE 1995 - Taipei Duration: 6 Mar 1995 → 10 Mar 1995 Conference number: 11 |
Other
Other | 11th International Conference on Data Engineering, ICDE 1995 |
---|---|
Abbreviated title | ICDE |
City | Taipei |
Period | 6/03/95 → 10/03/95 |