In this paper, we report on our experience in building an experimental similarity search system on a test collection of more than 50 million images, to show the possibility to scale Content-based Image Retrieval (CBIR) systems towards the Web size. First, we had to tackle the non-trivial process of image crawling and descriptive feature extraction, performed by using the European EGEE computer GRID, building a test collection, the first of such scale, that will be opened to the research community for experiments and comparisons. Then, we had to develop indexing and searching mechanisms which can scale up to these volumes and answer similarity queries in real-time. The results of our experiments are very encouraging for future applications.

Crawling, indexing, and similarity searching images on the Web

Falchi F;Lucchese C;Perego R;Rabitti F;
2008

Abstract

In this paper, we report on our experience in building an experimental similarity search system on a test collection of more than 50 million images, to show the possibility to scale Content-based Image Retrieval (CBIR) systems towards the Web size. First, we had to tackle the non-trivial process of image crawling and descriptive feature extraction, performed by using the European EGEE computer GRID, building a test collection, the first of such scale, that will be opened to the research community for experiments and comparisons. Then, we had to develop indexing and searching mechanisms which can scale up to these volumes and answer similarity queries in real-time. The results of our experiments are very encouraging for future applications.
2008
Istituto di Scienza e Tecnologie dell'Informazione "Alessandro Faedo" - ISTI
Inglese
Salvatore Gaglio, Ignazio Infantino, Domenico Saccà
Sixteenth Italian Symposium on Advanced Database Systems
382
389
Sì, ma tipo non specificato
22-25 June 2008
Mondello (PA), Italy
Similarity search
Content-based image retrieval
Metric space
MPEG-7 descriptors
Peer-to-peer search network
4
open
Batko M.; Falchi F.; Lucchese C.; Novak D.; Perego R.; Rabitti F.; Sedmidubsky J.; Zezula P.
273
info:eu-repo/semantics/conferenceObject
04 Contributo in convegno::04.01 Contributo in Atti di convegno
File in questo prodotto:
File Dimensione Formato  
prod_91836-doc_128537.pdf

accesso aperto

Descrizione: Crawling, indexing, and similarity searching images on the Web
Tipologia: Versione Editoriale (PDF)
Dimensione 248.9 kB
Formato Adobe PDF
248.9 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/58497
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 2
  • ???jsp.display-item.citation.isi??? ND
social impact