This paper proposes CCSM (Cache-based Constrained Sequence Miner), a new level-wise algorithm that mines temporal databases to find sequential patterns satisfying user-defined constraints. The main innovation of CCSM is the adoption of k-way intersections of idlists to compute the support of candidate sequences. Our k-way intersection method is enhanced by the use of an e ective cache that stores intermediate idlists for future reuse. The exploitation of the cache entails a surprising reduction in the actual number of join operations performed on idlists. Moreover, CCSM is able to deal with very complex constraints, like the maximum temporal gap between events occurring in the input sequences. We experimentally evaluated the performances of CCSM on synthetically generated datasets, and compared them with those obtained running the cSPADE algorithm on the same datasets.

CCSM: an Efficient Algorithm for Constrained Sequence Mining

Orlando S;Perego R;
2003

Abstract

This paper proposes CCSM (Cache-based Constrained Sequence Miner), a new level-wise algorithm that mines temporal databases to find sequential patterns satisfying user-defined constraints. The main innovation of CCSM is the adoption of k-way intersections of idlists to compute the support of candidate sequences. Our k-way intersection method is enhanced by the use of an e ective cache that stores intermediate idlists for future reuse. The exploitation of the cache entails a surprising reduction in the actual number of join operations performed on idlists. Moreover, CCSM is able to deal with very complex constraints, like the maximum temporal gap between events occurring in the input sequences. We experimentally evaluated the performances of CCSM on synthetically generated datasets, and compared them with those obtained running the cSPADE algorithm on the same datasets.
2003
Istituto di Scienza e Tecnologie dell'Informazione "Alessandro Faedo" - ISTI
Sequential pattern mining
File in questo prodotto:
File Dimensione Formato  
prod_120299-doc_123376.pdf

accesso aperto

Descrizione: CCSM: an Efficient Algorithm for Constrained Sequence Mining
Tipologia: Versione Editoriale (PDF)
Dimensione 267.79 kB
Formato Adobe PDF
267.79 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/101381
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact