Abstract_Discovering frequent patterns in large datasets is one of the more pervasive data mining tasks. Albeit rooted in market basket analysis, frequent pattern mining can be adopted in many applications, and on data sources of different nature and structure; it also provides a basis for several other mining tasks, such as association rules, classification, and clustering. However, frequent pattern mining is inherently difficult, in that it handles typically too many input data, which typically yield too many patterns as a result - this is often an insuperable obstacle, both for performance limitations and for the impossibility to discern the interesting patterns from the many, mostly uninteresting, extracted ones. Preprocessing based on data reduction and userspecified constraints may be the solution to this problem: it may drive the mining process towards potentially interesting patterns, while enabling query optimizations at the same time. We show how this can be achieved on the basis of a simple yet powerful idea: combine constraints of different nature to the purpose of dramatically reducing the input database. The mining process after such preprocessing is strikingly optimized, both in terms of performance, and in capability of focussing on interesting patterns.

ExAnte: a preprocessing method for frequent-pattern mining

Bonchi F;Giannotti F;
2005

Abstract

Abstract_Discovering frequent patterns in large datasets is one of the more pervasive data mining tasks. Albeit rooted in market basket analysis, frequent pattern mining can be adopted in many applications, and on data sources of different nature and structure; it also provides a basis for several other mining tasks, such as association rules, classification, and clustering. However, frequent pattern mining is inherently difficult, in that it handles typically too many input data, which typically yield too many patterns as a result - this is often an insuperable obstacle, both for performance limitations and for the impossibility to discern the interesting patterns from the many, mostly uninteresting, extracted ones. Preprocessing based on data reduction and userspecified constraints may be the solution to this problem: it may drive the mining process towards potentially interesting patterns, while enabling query optimizations at the same time. We show how this can be achieved on the basis of a simple yet powerful idea: combine constraints of different nature to the purpose of dramatically reducing the input database. The mining process after such preprocessing is strikingly optimized, both in terms of performance, and in capability of focussing on interesting patterns.
2005
Istituto di Scienza e Tecnologie dell'Informazione "Alessandro Faedo" - ISTI
Data Mining
Frequent pattern discovery
constraint pattern discovery
File in questo prodotto:
File Dimensione Formato  
prod_43820-doc_33192.pdf

solo utenti autorizzati

Descrizione: ExAnte: A preprocessing method for frequent-pattern mining
Tipologia: Versione Editoriale (PDF)
Dimensione 1.73 MB
Formato Adobe PDF
1.73 MB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/37387
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 12
  • ???jsp.display-item.citation.isi??? ND
social impact