Semantic enrichment methods may be used to identify relevant entities in textual documents. These extracted entities are part of knowledge graphs and thus linked by semantic relationships. This work explores the idea of navigating the semantic relationships among extracted entities as a way to search a text corpus. A modular software system (including document management, semantic enrichment, data consolidation, and data integration) has been designed, to offer a visual user interface for such navigation on top of an arbitrary corpus of textual documents. The software, called arca, has been used in a real use case: to search in the book catalogue of a publishing house. The evaluation carried out with a set of potential users has shown so far the feasibility and effectiveness of the approach. Critical issues and potential limitations of the paradigm have also been found and are discussed.
Exploring a text corpus via a knowledge graph
Ceriani M.
Secondo
;
2021
Abstract
Semantic enrichment methods may be used to identify relevant entities in textual documents. These extracted entities are part of knowledge graphs and thus linked by semantic relationships. This work explores the idea of navigating the semantic relationships among extracted entities as a way to search a text corpus. A modular software system (including document management, semantic enrichment, data consolidation, and data integration) has been designed, to offer a visual user interface for such navigation on top of an arbitrary corpus of textual documents. The software, called arca, has been used in a real use case: to search in the book catalogue of a publishing house. The evaluation carried out with a set of potential users has shown so far the feasibility and effectiveness of the approach. Critical issues and potential limitations of the paradigm have also been found and are discussed.File | Dimensione | Formato | |
---|---|---|---|
paper8.pdf
accesso aperto
Descrizione: Exploring a Text Corpus via a Knowledge Graph
Tipologia:
Versione Editoriale (PDF)
Licenza:
Creative commons
Dimensione
2.84 MB
Formato
Adobe PDF
|
2.84 MB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.