CNR Institutional Research Information System

Abstract_Discovering frequent patterns in large datasets is one of the more pervasive data mining tasks. Albeit rooted in market basket analysis, frequent pattern mining can be adopted in many applications, and on data sources of different nature and structure; it also provides a basis for several other mining tasks, such as association rules, classification, and clustering. However, frequent pattern mining is inherently difficult, in that it handles typically too many input data, which typically yield too many patterns as a result - this is often an insuperable obstacle, both for performance limitations and for the impossibility to discern the interesting patterns from the many, mostly uninteresting, extracted ones. Preprocessing based on data reduction and userspecified constraints may be the solution to this problem: it may drive the mining process towards potentially interesting patterns, while enabling query optimizations at the same time. We show how this can be achieved on the basis of a simple yet powerful idea: combine constraints of different nature to the purpose of dramatically reducing the input database. The mining process after such preprocessing is strikingly optimized, both in terms of performance, and in capability of focussing on interesting patterns.

ExAnte: a preprocessing method for frequent-pattern mining

Bonchi F;Giannotti F;Mazzanti A;Pedreschi D

2005

Abstract

Abstract_Discovering frequent patterns in large datasets is one of the more pervasive data mining tasks. Albeit rooted in market basket analysis, frequent pattern mining can be adopted in many applications, and on data sources of different nature and structure; it also provides a basis for several other mining tasks, such as association rules, classification, and clustering. However, frequent pattern mining is inherently difficult, in that it handles typically too many input data, which typically yield too many patterns as a result - this is often an insuperable obstacle, both for performance limitations and for the impossibility to discern the interesting patterns from the many, mostly uninteresting, extracted ones. Preprocessing based on data reduction and userspecified constraints may be the solution to this problem: it may drive the mining process towards potentially interesting patterns, while enabling query optimizations at the same time. We show how this can be achieved on the basis of a simple yet powerful idea: combine constraints of different nature to the purpose of dramatically reducing the input database. The mining process after such preprocessing is strikingly optimized, both in terms of performance, and in capability of focussing on interesting patterns.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2005
			
	Strutture organizzative
	
				Istituto di Scienza e Tecnologie dell'Informazione "Alessandro Faedo" - ISTI
			
	Parole chiave
	
				Data Mining
Frequent pattern discovery
constraint pattern discovery
			
	Appare nelle tipologie:
	
				01.01 Articolo in rivista

File in questo prodotto:

File	Dimensione	Formato
prod_43820-doc_33192.pdf solo utenti autorizzati Descrizione: ExAnte: A preprocessing method for frequent-pattern mining Tipologia: Versione Editoriale (PDF) Dimensione 1.73 MB Formato Adobe PDF Visualizza/Apri Richiedi una copia	1.73 MB	Adobe PDF	Visualizza/Apri Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/37387

Citazioni

ND

12

10

social impact