Our research challenge is to provide a mechanism for splitting into user task-based sessions a long-term log of queries submitted to a Web Search Engine (WSE). The hypothesis is that some query sessions entail the concept of user task. We present an approach that relies on a centroid-based and a density-based clustering algorithm, which consider queries inter-arrival times and use a novel distance function that takes care of query lexical content and exploits the collaborative knowledge collected by Wiktionary and Wikipedia.
Detecting task-based query sessions using collaborative knowledge
Lucchese C;Orlando S;Perego R;Silvestri F;
2010
Abstract
Our research challenge is to provide a mechanism for splitting into user task-based sessions a long-term log of queries submitted to a Web Search Engine (WSE). The hypothesis is that some query sessions entail the concept of user task. We present an approach that relies on a centroid-based and a density-based clustering algorithm, which consider queries inter-arrival times and use a novel distance function that takes care of query lexical content and exploits the collaborative knowledge collected by Wiktionary and Wikipedia.File in questo prodotto:
File | Dimensione | Formato | |
---|---|---|---|
prod_92074-doc_131481.pdf
solo utenti autorizzati
Descrizione: Software Services and Systems Network (S-Cube)
Tipologia:
Versione Editoriale (PDF)
Dimensione
250.34 kB
Formato
Adobe PDF
|
250.34 kB | Adobe PDF | Visualizza/Apri Richiedi una copia |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.