A genetic algorithm, that exploits the K-means principles for dividing objects in groups having high similarity, is proposed. The method evolves a population of chromosomes, each representing a division of objects in a different number of clusters. A group-based crossover, enriched with the one-step K-means operator, and a mutation strategy that reassigns objects to clusters on the base of their distance to the clusters computed so far, allow the approach to determine the best number of groups present in the dataset. The method has been experimented with four different fitness functions on both synthetic and real-world datasets, for which the ground-truth division is known, and compared with the K-means method. Results show that the approach obtains higher values of evaluation indexes than that obtained by the K-means method.

A K-means based genetic algorithm for data clustering

Pizzuti C;Procopio N
2016

Abstract

A genetic algorithm, that exploits the K-means principles for dividing objects in groups having high similarity, is proposed. The method evolves a population of chromosomes, each representing a division of objects in a different number of clusters. A group-based crossover, enriched with the one-step K-means operator, and a mutation strategy that reassigns objects to clusters on the base of their distance to the clusters computed so far, allow the approach to determine the best number of groups present in the dataset. The method has been experimented with four different fitness functions on both synthetic and real-world datasets, for which the ground-truth division is known, and compared with the K-means method. Results show that the approach obtains higher values of evaluation indexes than that obtained by the K-means method.
2016
Istituto di Calcolo e Reti ad Alte Prestazioni - ICAR
Inglese
11th International Conference on Soft Computing Models in Industrial and Environmental Applications
527
211
222
http://www.scopus.com/record/display.url?eid=2-s2.0-84992450242&origin=inward
Sì, ma tipo non specificato
19th - 21st October, 2016
SAN SEBASTIAN, SPAIN
Genetic algorithms
clustering
Kmeans
2
none
Pizzuti, C; Procopio, N
273
info:eu-repo/semantics/conferenceObject
04 Contributo in convegno::04.01 Contributo in Atti di convegno
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/321527
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 14
  • ???jsp.display-item.citation.isi??? ND
social impact