Closed itemsets are semantically equivalent to frequent itemsets but are orders of magnitude fewer, thus allowing the knowledge extracted from a transactional database to be represented very concisely. Unfortunately, no algorithm has been yet devised which allows to mine closed patterns directly. All existing algorithms may in fact generate the same closed itemset multiple times, and need to maintain the closed itemsets mined so far in the main memory in order to check if the current element has been already discovered or not. In this paper we propose a general technique for promptly detecting and discarding duplicate closed itemsets without the need of keeping in the main memory the whole set of closed patterns. Our approach can be exploited with substantial performance benefits by any depth first algorithms for mining frequent closed itemsets. As a case of study, we implemented our technique within a new mining algorithm which adopts a vertical bitmap representation of the dataset. The experimental evaluation demonstrates that our algorithm outperforms other state of the art algorithms such as CHARM, CLOSET+ and FPCLOSE.

Mining Frequent Closed Itemsets without Duplicates Generation

Lucchese C;Perego R
2004

Abstract

Closed itemsets are semantically equivalent to frequent itemsets but are orders of magnitude fewer, thus allowing the knowledge extracted from a transactional database to be represented very concisely. Unfortunately, no algorithm has been yet devised which allows to mine closed patterns directly. All existing algorithms may in fact generate the same closed itemset multiple times, and need to maintain the closed itemsets mined so far in the main memory in order to check if the current element has been already discovered or not. In this paper we propose a general technique for promptly detecting and discarding duplicate closed itemsets without the need of keeping in the main memory the whole set of closed patterns. Our approach can be exploited with substantial performance benefits by any depth first algorithms for mining frequent closed itemsets. As a case of study, we implemented our technique within a new mining algorithm which adopts a vertical bitmap representation of the dataset. The experimental evaluation demonstrates that our algorithm outperforms other state of the art algorithms such as CHARM, CLOSET+ and FPCLOSE.
2004
Istituto di Scienza e Tecnologie dell'Informazione "Alessandro Faedo" - ISTI
Frequent closed itemsets
Association rules
File in questo prodotto:
File Dimensione Formato  
prod_160716-doc_125698.pdf

accesso aperto

Descrizione: Mining Frequent Closed Itemsets without Duplicates Generation
Dimensione 149.59 kB
Formato Adobe PDF
149.59 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/152886
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact