This paper discusses the design and implementation of SDC, a new caching strategy aimed to e ciently exploit the locality present in the stream of queries submitted to a Web Search Engine. SDC stores the results of the most frequently submitted queries in a fixed-size read-only portion of the cache, while the queries that cannot be satis ed by the static portion compete for the remaining entries of the cache according to a given cache replacement policy. We experimentally demonstrated the superiority of SDC over purely static and dynamic policies by measuring the hit-ratio achieved on two large query logs by varying cache parameters and the replacement policy used. Finally, we propose an implementation optimized for concurrent accesses, and we accurately evaluate its scalability.

A highly scalable parallel caching system for web search engine results

Fagni T;Perego R;Silvestri F
2004

Abstract

This paper discusses the design and implementation of SDC, a new caching strategy aimed to e ciently exploit the locality present in the stream of queries submitted to a Web Search Engine. SDC stores the results of the most frequently submitted queries in a fixed-size read-only portion of the cache, while the queries that cannot be satis ed by the static portion compete for the remaining entries of the cache according to a given cache replacement policy. We experimentally demonstrated the superiority of SDC over purely static and dynamic policies by measuring the hit-ratio achieved on two large query logs by varying cache parameters and the replacement policy used. Finally, we propose an implementation optimized for concurrent accesses, and we accurately evaluate its scalability.
2004
Istituto di Scienza e Tecnologie dell'Informazione "Alessandro Faedo" - ISTI
Inglese
Marco Danelutto, Marco Vanneschi, Domenico Laforenza
Euro-Par 2004 Parallel Processing, 10th International Euro-Par Conference, (Pisa, Italy, 31 August - 3 September, 2004).
Euro-Par 2004 - Parallel Processing, 10th International Euro-Par Conference
347
354
8
978-3-540-22924-7
Springer-Verlag - Berlin Heidelberg New York
Berlin
GERMANIA
Sì, ma tipo non specificato
31 August - 3 September 2004
Pisa, Italy
Caching
Search engines
IF: 0,513
3
restricted
Fagni T.; Perego R.; Silvestri F.
273
info:eu-repo/semantics/conferenceObject
04 Contributo in convegno::04.01 Contributo in Atti di convegno
File in questo prodotto:
File Dimensione Formato  
prod_43727-doc_123129.pdf

solo utenti autorizzati

Descrizione: A highly scalable parallel caching system for web search engine results
Tipologia: Versione Editoriale (PDF)
Dimensione 151.13 kB
Formato Adobe PDF
151.13 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/36587
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
social impact