Interpretable Data Partitioning Through Tree-Based Clustering Methods Riccardo Guidotti, Cristiano Landi, Andrea Beretta, Daniele Fadda & Mirco Nanni Conference paper First Online: 08 October 2023 311 Accesses Part of the Lecture Notes in Computer Science book series (LNAI,volume 14276) The growing interpretable machine learning research field is mainly focusing on the explanation of supervised approaches. However, also unsupervised approaches might benefit from considering interpretability aspects. While existing clustering methods only provide the assignment of records to clusters without justifying the partitioning, we propose tree-based clustering methods that offer interpretable data partitioning through a shallow decision tree. These decision trees enable easy-to-understand explanations of cluster assignments through short and understandable split conditions. The proposed methods are evaluated through experiments on synthetic and real datasets and proved to be more effective than traditional clustering approaches and interpretable ones in terms of standard evaluation measures and runtime. Finally, a case study involving human participation demonstrates the effectiveness of the interpretable clustering trees returned by the proposed method.

Interpretable data partitioning through tree-based clustering methods

Guidotti R;Beretta A;Fadda D;Nanni M
2023

Abstract

Interpretable Data Partitioning Through Tree-Based Clustering Methods Riccardo Guidotti, Cristiano Landi, Andrea Beretta, Daniele Fadda & Mirco Nanni Conference paper First Online: 08 October 2023 311 Accesses Part of the Lecture Notes in Computer Science book series (LNAI,volume 14276) The growing interpretable machine learning research field is mainly focusing on the explanation of supervised approaches. However, also unsupervised approaches might benefit from considering interpretability aspects. While existing clustering methods only provide the assignment of records to clusters without justifying the partitioning, we propose tree-based clustering methods that offer interpretable data partitioning through a shallow decision tree. These decision trees enable easy-to-understand explanations of cluster assignments through short and understandable split conditions. The proposed methods are evaluated through experiments on synthetic and real datasets and proved to be more effective than traditional clustering approaches and interpretable ones in terms of standard evaluation measures and runtime. Finally, a case study involving human participation demonstrates the effectiveness of the interpretable clustering trees returned by the proposed method.
2023
Istituto di Scienza e Tecnologie dell'Informazione "Alessandro Faedo" - ISTI
Inglese
Bifet A., Lorena A.C., Ribeiro R.P., Gama J., Abreu P.H.
Discovery Science
DS 2023 - 26th International Conference on Discovery Science
492
507
978-3-031-45274-1
https://link.springer.com/chapter/10.1007/978-3-031-45275-8_33
09-11/10/2023
Porto, Portugal
Interpretable clustering
Tree-based clustering
Interpretable data partitioning
Explainable unsupervised learning
4
restricted
Guidotti R.; Landi C.; Beretta A.; Fadda D.; Nanni M.
273
info:eu-repo/semantics/conferenceObject
04 Contributo in convegno::04.01 Contributo in Atti di convegno
File in questo prodotto:
File Dimensione Formato  
prod_490353-doc_204425.pdf

solo utenti autorizzati

Descrizione: Interpretable data partitioning through tree-based clustering methods
Tipologia: Versione Editoriale (PDF)
Dimensione 1.2 MB
Formato Adobe PDF
1.2 MB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/452223
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 3
  • ???jsp.display-item.citation.isi??? 2
social impact