Interpretable Data Partitioning Through Tree-Based Clustering Methods Riccardo Guidotti, Cristiano Landi, Andrea Beretta, Daniele Fadda & Mirco Nanni Conference paper First Online: 08 October 2023 311 Accesses Part of the Lecture Notes in Computer Science book series (LNAI,volume 14276) The growing interpretable machine learning research field is mainly focusing on the explanation of supervised approaches. However, also unsupervised approaches might benefit from considering interpretability aspects. While existing clustering methods only provide the assignment of records to clusters without justifying the partitioning, we propose tree-based clustering methods that offer interpretable data partitioning through a shallow decision tree. These decision trees enable easy-to-understand explanations of cluster assignments through short and understandable split conditions. The proposed methods are evaluated through experiments on synthetic and real datasets and proved to be more effective than traditional clustering approaches and interpretable ones in terms of standard evaluation measures and runtime. Finally, a case study involving human participation demonstrates the effectiveness of the interpretable clustering trees returned by the proposed method.

Interpretable data partitioning through tree-based clustering methods

Guidotti R;Beretta A;Fadda D;Nanni M
2023

Abstract

Interpretable Data Partitioning Through Tree-Based Clustering Methods Riccardo Guidotti, Cristiano Landi, Andrea Beretta, Daniele Fadda & Mirco Nanni Conference paper First Online: 08 October 2023 311 Accesses Part of the Lecture Notes in Computer Science book series (LNAI,volume 14276) The growing interpretable machine learning research field is mainly focusing on the explanation of supervised approaches. However, also unsupervised approaches might benefit from considering interpretability aspects. While existing clustering methods only provide the assignment of records to clusters without justifying the partitioning, we propose tree-based clustering methods that offer interpretable data partitioning through a shallow decision tree. These decision trees enable easy-to-understand explanations of cluster assignments through short and understandable split conditions. The proposed methods are evaluated through experiments on synthetic and real datasets and proved to be more effective than traditional clustering approaches and interpretable ones in terms of standard evaluation measures and runtime. Finally, a case study involving human participation demonstrates the effectiveness of the interpretable clustering trees returned by the proposed method.
2023
Istituto di Scienza e Tecnologie dell'Informazione "Alessandro Faedo" - ISTI
978-3-031-45274-1
Interpretable clustering
Tree-based clustering
Interpretable data partitioning
Explainable unsupervised learning
File in questo prodotto:
File Dimensione Formato  
prod_490353-doc_204425.pdf

solo utenti autorizzati

Descrizione: Interpretable data partitioning through tree-based clustering methods
Tipologia: Versione Editoriale (PDF)
Dimensione 1.2 MB
Formato Adobe PDF
1.2 MB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/452223
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 2
  • ???jsp.display-item.citation.isi??? ND
social impact