We propose a hierarchical, model-based co-clustering framework for handling high-dimensional datasets. The technique views the dataset as a joint probability distribution over row and column variables. Our approach starts by initially clustering rows in a dataset, where each cluster is characterized by a different probability distribution. Subsequently, the conditional distribution of attributes over tuples is exploited to discover co-clusters underlying the data. An intensive empirical evaluation confirms the effectiveness of our approach, even when compared to well-known co-clustering schemes available from the current literature.

Hierarchical Model-Based Co-Clustering of High-Dimensional Data

Francesco Folino;Giuseppe Manco;Riccardo Ortale
2007

Abstract

We propose a hierarchical, model-based co-clustering framework for handling high-dimensional datasets. The technique views the dataset as a joint probability distribution over row and column variables. Our approach starts by initially clustering rows in a dataset, where each cluster is characterized by a different probability distribution. Subsequently, the conditional distribution of attributes over tuples is exploited to discover co-clusters underlying the data. An intensive empirical evaluation confirms the effectiveness of our approach, even when compared to well-known co-clustering schemes available from the current literature.
2007
Istituto di Calcolo e Reti ad Alte Prestazioni - ICAR
978-88-902981-0-3
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/143078
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact