The problem of approximated similarity search for the range and nearest neighbor queries is investigated for generic metric spaces. The search speedup is achieved by ignoring data regions with a small, user dened, proximity with respect to the query. For zero proximity, exact similarity search is performed. The problem of proximity of metric regions is explained and a probabilistic approach is applied. Approximated algorithms use a small amount of auxiliary data that can easily be maintained in main memory. The idea is implemented in a metric tree environment and experimentally evaluated on real-life les using specic performance measures. Improvements of two orders of magnitude can be achieved for moderately approximated search results. It is also demon- strated that the precision of data regions' proximity measure signicantly influence approximated algorithms.

Approximate similarity search in metric data by using region proximity

Amato G;Rabitti F;Savino P;
2000

Abstract

The problem of approximated similarity search for the range and nearest neighbor queries is investigated for generic metric spaces. The search speedup is achieved by ignoring data regions with a small, user dened, proximity with respect to the query. For zero proximity, exact similarity search is performed. The problem of proximity of metric regions is explained and a probabilistic approach is applied. Approximated algorithms use a small amount of auxiliary data that can easily be maintained in main memory. The idea is implemented in a metric tree environment and experimentally evaluated on real-life les using specic performance measures. Improvements of two orders of magnitude can be achieved for moderately approximated search results. It is also demon- strated that the precision of data regions' proximity measure signicantly influence approximated algorithms.
2000
Istituto di Scienza e Tecnologie dell'Informazione "Alessandro Faedo" - ISTI
Inglese
Proc. of the First DELOS workshop on "Information Seeking, Searching and Querying in Digital Libraries"
First DELOS workshop on "Information Seeking, Searching and Querying in Digital Libraries"
101
106
7
http://www.ercim.eu/publication/ws-proceedings/DelNoe01/18_Amato.pdf
Sì, ma tipo non specificato
11-12 December 2000
Zurich, Switzerland
Similarity search
Information search and retrieval
Codice PuMa: cnr.iei/2000-A2-048
4
open
Amato, G; Rabitti, F; Savino, P; Zezula, P
273
info:eu-repo/semantics/conferenceObject
04 Contributo in convegno::04.01 Contributo in Atti di convegno
File in questo prodotto:
File Dimensione Formato  
prod_253548-doc_142288.pdf

accesso aperto

Descrizione: Approximate similarity search in metric data by using region proximity
Tipologia: Versione Editoriale (PDF)
Dimensione 466.62 kB
Formato Adobe PDF
466.62 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/184238
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact