The task of entity retrieval becomes increasingly prevalent as more and more (semi-) structured information about objects is available on the Web in the form of documents embedding metadata (RDF, RDFa, Microformats and others). However, research and development in that direction is dependent on (1) the availability of a representative corpus of entities that are found on the Web, and (2) the availability of an entity-oriented search infrastructure for experimenting new retrieval model. In this paper, we introduce the Sindice-2011 data collection which is derived from the data collected by the Sindice semantic search engine. The data collection is especially designed for supporting research in the domain of web entity retrieval. We describe how the corpus is organised, discuss statistics of the data collection, and introduce a search infrastructure to foster research and development.

The Sindice-2011 dataset for entity-oriented search in the Web of data

2011

Abstract

The task of entity retrieval becomes increasingly prevalent as more and more (semi-) structured information about objects is available on the Web in the form of documents embedding metadata (RDF, RDFa, Microformats and others). However, research and development in that direction is dependent on (1) the availability of a representative corpus of entities that are found on the Web, and (2) the availability of an entity-oriented search infrastructure for experimenting new retrieval model. In this paper, we introduce the Sindice-2011 data collection which is derived from the data collected by the Sindice semantic search engine. The data collection is especially designed for supporting research in the domain of web entity retrieval. We describe how the corpus is organised, discuss statistics of the data collection, and introduce a search infrastructure to foster research and development.
2011
Istituto di Scienza e Tecnologie dell'Informazione "Alessandro Faedo" - ISTI
Inglese
Balog K., de Vries A.P., Serdyukov P., Wen, J-R.
1st International Workshop on Entity-Oriented Search, vol. 1
1st International Workshop on Entity-Oriented Search
26
32
978-94-6186-000-2
http://research.microsoft.com/en-us/um/beijing/events/eos2011/
TU Delft
Delft
PAESI BASSI
Sì, ma tipo non specificato
28 July 2011
Beijing
Entity search
Web of Data
Entity corpus
ID_PUMA: /cnr.isti/2011-A2-040. - Area di valutazione 01 - Scienze matematiche e informatiche
0
restricted
Stéphane, Campinas ; Ceccarelli, Diego ; Perry, Thomas ; Delbru, Renaud ; Tummarello, Giovanni ; Balog, Krisztian
273
info:eu-repo/semantics/conferenceObject
04 Contributo in convegno::04.01 Contributo in Atti di convegno
File in questo prodotto:
File Dimensione Formato  
prod_204419-doc_45591.pdf

solo utenti autorizzati

Descrizione: contributo
Tipologia: Versione Editoriale (PDF)
Dimensione 793.29 kB
Formato Adobe PDF
793.29 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/178941
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact