In this paper we address the problem of arbitrarily shaped clustering of points belonging to a linear space. A powerful method for finding this kind of structure is the spectral clustering which often outperforms traditional clustering algorithms such as k-means. It acts through the eigenvectors of a matrix which describes the similarity between pairs of data points. Two important items contribute to the construction of this similarity matrix: the sparsity of the underlying weighted graph and the similarity function, in general a Gaussian which depends on a scale parameter ? to be selected. Both issues are critical and in this paper we propose a heuristic strategy which reduces these criticalities by performing first a partition, by means of a spectral clustering in a number of convex clusters k sufficiently greater than the target one, and then a merging technique which makes use only of the information already produced. A numerical experimentation on artificial dataset has been carried out to compare the performances of the spectral clustering with the proposed method.

A robust strategy based on spectral clustering and a merging technique for the non convex clustering

P Favati;
2021

Abstract

In this paper we address the problem of arbitrarily shaped clustering of points belonging to a linear space. A powerful method for finding this kind of structure is the spectral clustering which often outperforms traditional clustering algorithms such as k-means. It acts through the eigenvectors of a matrix which describes the similarity between pairs of data points. Two important items contribute to the construction of this similarity matrix: the sparsity of the underlying weighted graph and the similarity function, in general a Gaussian which depends on a scale parameter ? to be selected. Both issues are critical and in this paper we propose a heuristic strategy which reduces these criticalities by performing first a partition, by means of a spectral clustering in a number of convex clusters k sufficiently greater than the target one, and then a merging technique which makes use only of the information already produced. A numerical experimentation on artificial dataset has been carried out to compare the performances of the spectral clustering with the proposed method.
2021
Istituto di informatica e telematica - IIT
Similarity matrix
Multiclass spectral clustering
Hierarchical algorithm
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/401552
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact