The problem of approximated similarity search for the range and nearest neighbor queries is investigated for generic metric spaces. The search speedup is achieved by ignoring data regions with a small, user dened, proximity with respect to the query. For zero proximity, exact similarity search is performed. The problem of proximity of metric regions is explained and a probabilistic approach is applied. Approximated algorithms use a small amount of auxiliary data that can easily be maintained in main memory. The idea is implemented in a metric tree environment and experimentally evaluated on real-life les using specic performance measures. Improvements of two orders of magnitude can be achieved for moderately approximated search results. It is also demon- strated that the precision of data regions' proximity measure signicantly influence approximated algorithms.

Approximate similarity search in metric data by using region proximity

Amato G;Rabitti F;Savino P;
2000

Abstract

The problem of approximated similarity search for the range and nearest neighbor queries is investigated for generic metric spaces. The search speedup is achieved by ignoring data regions with a small, user dened, proximity with respect to the query. For zero proximity, exact similarity search is performed. The problem of proximity of metric regions is explained and a probabilistic approach is applied. Approximated algorithms use a small amount of auxiliary data that can easily be maintained in main memory. The idea is implemented in a metric tree environment and experimentally evaluated on real-life les using specic performance measures. Improvements of two orders of magnitude can be achieved for moderately approximated search results. It is also demon- strated that the precision of data regions' proximity measure signicantly influence approximated algorithms.
2000
Istituto di Scienza e Tecnologie dell'Informazione "Alessandro Faedo" - ISTI
Similarity search
Information search and retrieval
File in questo prodotto:
File Dimensione Formato  
prod_253548-doc_142288.pdf

accesso aperto

Descrizione: Approximate similarity search in metric data by using region proximity
Tipologia: Versione Editoriale (PDF)
Dimensione 466.62 kB
Formato Adobe PDF
466.62 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/184238
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact