The increasing complexity of the web arises the need of tools able to extract knowledge from this huge amount of information. The tool proposed in this work allows automatic extraction of knowledge from Reuters agency news stream. The approach maps relevant terms in a semantic space and selects from the corpus a set of triplets of words in a subject-verb-object (s-v-o) form. The mapping into a semantic space allows for the identification of meaningful relations among the words, giving also a measure of the strength of these relations.

Semantic Driven Triplet Extraction From Unstructured Text

Marilena Ditta;Fabrizio Milazzo;Agnese Augello;Giovanni Pilato
2015

Abstract

The increasing complexity of the web arises the need of tools able to extract knowledge from this huge amount of information. The tool proposed in this work allows automatic extraction of knowledge from Reuters agency news stream. The approach maps relevant terms in a semantic space and selects from the corpus a set of triplets of words in a subject-verb-object (s-v-o) form. The mapping into a semantic space allows for the identification of meaningful relations among the words, giving also a measure of the strength of these relations.
2015
Istituto di Calcolo e Reti ad Alte Prestazioni - ICAR
Semantic spaces
LSA
triplets extraction
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/309961
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact