The purpose of this work is to investigate the use of new swarm intelligence based techniques for data mining. According to this approach the data mining task is constructed as a set of biologically inspired agents. Each agent represents a simple task and the success of the method depends on the cooperative work of the agents. In this paper, we present a novel algorithm that uses techniques adapted from models originating from biological collective organisms to discover clusters of arbitrary shape, size and density in spatial data. The algorithm combines a smart exploratory strategy based on the movements of a flock of birds with a shared nearest neighbour clustering algorithm to discover clusters in parallel. In the algorithm, birds are used as agents with a exploring behaviour foraging for clusters. Moreover, this strategy can be used as a data reduction technique to perform efficiently approximate clustering. We have applied this algorithm on synthetic and real world datasets and we have measured, through computer simulation, the impact of the flocking search strategy on performance.

Discovering Clusters in Spatial Data using Swarm Intelligence

Folino Gianluigi;Forestiero Agostino;Spezzano Giandomenico
2003

Abstract

The purpose of this work is to investigate the use of new swarm intelligence based techniques for data mining. According to this approach the data mining task is constructed as a set of biologically inspired agents. Each agent represents a simple task and the success of the method depends on the cooperative work of the agents. In this paper, we present a novel algorithm that uses techniques adapted from models originating from biological collective organisms to discover clusters of arbitrary shape, size and density in spatial data. The algorithm combines a smart exploratory strategy based on the movements of a flock of birds with a shared nearest neighbour clustering algorithm to discover clusters in parallel. In the algorithm, birds are used as agents with a exploring behaviour foraging for clusters. Moreover, this strategy can be used as a data reduction technique to perform efficiently approximate clustering. We have applied this algorithm on synthetic and real world datasets and we have measured, through computer simulation, the impact of the flocking search strategy on performance.
2003
Istituto di Calcolo e Reti ad Alte Prestazioni - ICAR
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/126525
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 1
  • ???jsp.display-item.citation.isi??? 1
social impact