Similarity join is an interesting complement of the well-established similarity range and nearest neighbors search primitives in metric spaces. However, the quadratic computational complexity of similarity join prevents from applications on large data collections. We present MCAN+, an extension of MCAN (a Content-Addressable Network for metric objects) to support similarity self join queries. The challenge of the proposed approach is to address the problem of the intrinsic quadratic complexity of similarity joins, with the aim of limiting the elaboration time, by involving an increasing number of computational nodes as the dataset size grows. To test the scalability of MCAN+, we used a real-life dataset of color features extracted from one million images of the Flickr photo sharing website.

A content-addressable network for similarity join in metric spaces

Gennaro C
2008

Abstract

Similarity join is an interesting complement of the well-established similarity range and nearest neighbors search primitives in metric spaces. However, the quadratic computational complexity of similarity join prevents from applications on large data collections. We present MCAN+, an extension of MCAN (a Content-Addressable Network for metric objects) to support similarity self join queries. The challenge of the proposed approach is to address the problem of the intrinsic quadratic complexity of similarity joins, with the aim of limiting the elaboration time, by involving an increasing number of computational nodes as the dataset size grows. To test the scalability of MCAN+, we used a real-life dataset of color features extracted from one million images of the Flickr photo sharing website.
2008
Istituto di Scienza e Tecnologie dell'Informazione "Alessandro Faedo" - ISTI
Inglese
Third Interational ICST Conference on Scalable Information Systems. Infoscale'08
5
978-963-9799-28-8
http://dl.acm.org/citation.cfm?id=1459709
ICST (Institute for Computer Sciences, Social-Informatics and Telecommunications Engineering)
Brussels
BELGIO
Sì, ma tipo non specificato
4-6 June 2008
Vico Equense, Italy
Similarity Join
Content-Addressable Network
Metric Space
Articolo n. 11
1
restricted
Gennaro C.
273
info:eu-repo/semantics/conferenceObject
04 Contributo in convegno::04.01 Contributo in Atti di convegno
File in questo prodotto:
File Dimensione Formato  
prod_91808-doc_36881.pdf

solo utenti autorizzati

Descrizione: A content-addressable network for similarity join in metric spaces
Tipologia: Versione Editoriale (PDF)
Dimensione 440.79 kB
Formato Adobe PDF
440.79 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/58469
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact