While there exists an increasingly large number of Linked Data, metadata about the content covered by individual datasets is sparse. In this paper, we introduce a processing pipeline to automatically assess, annotate and index available linked datasets. Given a minimal description of a dataset from the DataHub, the process produces a structured RDF-based description that includes information about its main topics. Additionally, the generated descriptions embed datasets into an interlinked graph of datasets based on shared topic vocabularies. We adopt and integrate techniques for Named Entity Recognition and auto- mated data validation, providing a consistent work ow for dataset profiling and annotation. Finally, we validate the results obtained with our tool.
Generating structured Profiles of Linked Data Graphs
2013
Abstract
While there exists an increasingly large number of Linked Data, metadata about the content covered by individual datasets is sparse. In this paper, we introduce a processing pipeline to automatically assess, annotate and index available linked datasets. Given a minimal description of a dataset from the DataHub, the process produces a structured RDF-based description that includes information about its main topics. Additionally, the generated descriptions embed datasets into an interlinked graph of datasets based on shared topic vocabularies. We adopt and integrate techniques for Named Entity Recognition and auto- mated data validation, providing a consistent work ow for dataset profiling and annotation. Finally, we validate the results obtained with our tool.| File | Dimensione | Formato | |
|---|---|---|---|
|
prod_273578-doc_76485.pdf
non disponibili
Descrizione: Generating structure Profiles of Linked Data Graphs
Tipologia:
Versione Editoriale (PDF)
Dimensione
259.35 kB
Formato
Adobe PDF
|
259.35 kB | Adobe PDF | Visualizza/Apri Richiedi una copia |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


