In this work we propose a solution for the problem of the entities and relations extraction from textual documents to build an index for a semantically oriented search engine. The approach we propose is based on the integration of statistical classifiers and ontological constraints through Markov random fields. Owing to the high computational complexity of the approach, the architecture of our system is distributed and exploits parallelisation to lower processing time. In the experimental assessment we show how the proposed system can be effectively applied to a large data set, namely BioNLP-ST 2013. While the experimental results provided in the paper refer to a biomedical application, the approach is very general and can be ported to different domains.

A distributed architecture to integrate ontological nowledge into information extraction

Silvestri Stefano
2016

Abstract

In this work we propose a solution for the problem of the entities and relations extraction from textual documents to build an index for a semantically oriented search engine. The approach we propose is based on the integration of statistical classifiers and ontological constraints through Markov random fields. Owing to the high computational complexity of the approach, the architecture of our system is distributed and exploits parallelisation to lower processing time. In the experimental assessment we show how the proposed system can be effectively applied to a large data set, namely BioNLP-ST 2013. While the experimental results provided in the paper refer to a biomedical application, the approach is very general and can be ported to different domains.
2016
Istituto di Calcolo e Reti ad Alte Prestazioni - ICAR
support vector machines
information extraction
graphical models
entity classification
relation extraction
relation classification
knowledge integration
ontological contraints
Markov random fields
distributed computing
File in questo prodotto:
File Dimensione Formato  
Pubblicazione3.pdf

solo utenti autorizzati

Tipologia: Versione Editoriale (PDF)
Licenza: NON PUBBLICO - Accesso privato/ristretto
Dimensione 927.4 kB
Formato Adobe PDF
927.4 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/339302
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 13
  • ???jsp.display-item.citation.isi??? ND
social impact