Clustering is the subset of data mining techniques used to agnostically classify entities by looking at their attributes. Clustering algorithms specialized to deal with complex networks are called community discovery. Notwithstanding their common objectives, there are crucial assumptions in community discovery edge sparsity and only one node type, among others which makes its mapping to clustering non trivial. In this paper, we propose a community discovery to clustering mapping, by focusing on transactional data clustering. We represent a network as a transactional dataset, and we find communities by grouping nodes with common items (neighbors) in their baskets (neighbor lists). By comparing our results with ground truth communities and state of the art community discovery methods, we show that transactional clustering algorithms are a feasible alternative to community discovery, and that a complete mapping of the two problems is possible.

On the Equivalence Between Community Discovery and Clustering

Guidotti R;
2018

Abstract

Clustering is the subset of data mining techniques used to agnostically classify entities by looking at their attributes. Clustering algorithms specialized to deal with complex networks are called community discovery. Notwithstanding their common objectives, there are crucial assumptions in community discovery edge sparsity and only one node type, among others which makes its mapping to clustering non trivial. In this paper, we propose a community discovery to clustering mapping, by focusing on transactional data clustering. We represent a network as a transactional dataset, and we find communities by grouping nodes with common items (neighbors) in their baskets (neighbor lists). By comparing our results with ground truth communities and state of the art community discovery methods, we show that transactional clustering algorithms are a feasible alternative to community discovery, and that a complete mapping of the two problems is possible.
2018
Istituto di Scienza e Tecnologie dell'Informazione "Alessandro Faedo" - ISTI
Inglese
Barbara Guidi, Laura Ricci, Carlos Calafate, Ombretta Gaggi, Johann Marquez-Barja
Smart Objects and Technologies for Social Good Third International Conference, GOODTECHS 2017, Pisa, Italy, November 29-30, 2017, Proceedings
3rd EAI International Conference on Smart Objects and Technologies for Social Good
342
352
978-3-319-76111-4
https://link.springer.com/chapter/10.1007/978-3-319-76111-4_34#citeas
Sì, ma tipo non specificato
29-30/11/2017
Pisa, Italy
Clustering
Community Discovery
Transactional Clustering
Problem Equivalence
1
open
Guidotti R.; Coscia M.
273
info:eu-repo/semantics/conferenceObject
04 Contributo in convegno::04.01 Contributo in Atti di convegno
   SoBigData Research Infrastructure
   SoBigData
   H2020
   654024
File in questo prodotto:
File Dimensione Formato  
prod_384336-doc_131281.pdf

accesso aperto

Descrizione: goodcoclu2017guidotti
Tipologia: Versione Editoriale (PDF)
Dimensione 678.56 kB
Formato Adobe PDF
678.56 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/348373
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 7
  • ???jsp.display-item.citation.isi??? ND
social impact