Entity linking deals with identifying entities from a knowledge base in a given piece of text and has become a fundamental building block for web search engines, enabling numerous downstream improvements from better document ranking to enhanced search results pages. A key problem in the context of web search queries is that this process needs to run under severe time constraints as it has to be performed before any actual retrieval takes place, typically within milliseconds. In this paper we propose a probabilistic model that lever-ages user-generated information on the web to link queries to entities in a knowledge base. There are three key ingredi-ents that make the algorithm fast and space-effcient. First, the linking process ignores any dependencies between the different entity candidates, which allows for a O (k 2 ) imple-mentation in the number of query terms. Second, we leverage hashing and compression techniques to reduce the memory footprint. Finally, to equip the algorithm with contextual knowledge without sacrificing speed, we factor the distance between distributional semantics of the query words and entities into the model. We show that our solution significantly outperforms several state-of-the-art baselines by more than 14% while being able to process queries in sub-millisecond times|at least two orders of magnitude faster than existing systems.

Fast and space-efficient entity linking in queries

2015

Abstract

Entity linking deals with identifying entities from a knowledge base in a given piece of text and has become a fundamental building block for web search engines, enabling numerous downstream improvements from better document ranking to enhanced search results pages. A key problem in the context of web search queries is that this process needs to run under severe time constraints as it has to be performed before any actual retrieval takes place, typically within milliseconds. In this paper we propose a probabilistic model that lever-ages user-generated information on the web to link queries to entities in a knowledge base. There are three key ingredi-ents that make the algorithm fast and space-effcient. First, the linking process ignores any dependencies between the different entity candidates, which allows for a O (k 2 ) imple-mentation in the number of query terms. Second, we leverage hashing and compression techniques to reduce the memory footprint. Finally, to equip the algorithm with contextual knowledge without sacrificing speed, we factor the distance between distributional semantics of the query words and entities into the model. We show that our solution significantly outperforms several state-of-the-art baselines by more than 14% while being able to process queries in sub-millisecond times|at least two orders of magnitude faster than existing systems.
2015
Istituto di Scienza e Tecnologie dell'Informazione "Alessandro Faedo" - ISTI
978-1-4503-3317-7
Entity linking
Queries
Web search
Wikipedia
File in questo prodotto:
File Dimensione Formato  
prod_347581-doc_109459.pdf

solo utenti autorizzati

Descrizione: Fast and Space-Efficient Entity Linking in Queries
Tipologia: Versione Editoriale (PDF)
Dimensione 980.82 kB
Formato Adobe PDF
980.82 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/316056
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 170
  • ???jsp.display-item.citation.isi??? 111
social impact