This paper presents the automatic extension of Princeton WordNet with Named Entities (NEs). This new resource is called Named Entity WordNet. Our method maps the noun is-a hierarchy of WordNet to Wikipedia categories, identifies the NEs present in the latter and extracts different information from them such as written variants, definitions, etc. This information is inserted into a NE repository. A module that converts from this generic repository to the WordNet specific format has been developed. The paper explores different aspects of our methodology such as the treatment of polysemous terms, the identification of hyponyms within the Wikipedia categorization system, the identification of Wikipedia articles which are NEs and the design of a NE repository compliant with the LMF ISO standard. So far, this procedure enriches WordNet with 310,742 NEs and 381,043 "instance of" relations.

Named Entity WordNet

Monachini M
2008

Abstract

This paper presents the automatic extension of Princeton WordNet with Named Entities (NEs). This new resource is called Named Entity WordNet. Our method maps the noun is-a hierarchy of WordNet to Wikipedia categories, identifies the NEs present in the latter and extracts different information from them such as written variants, definitions, etc. This information is inserted into a NE repository. A module that converts from this generic repository to the WordNet specific format has been developed. The paper explores different aspects of our methodology such as the treatment of polysemous terms, the identification of hyponyms within the Wikipedia categorization system, the identification of Wikipedia articles which are NEs and the design of a NE repository compliant with the LMF ISO standard. So far, this procedure enriches WordNet with 310,742 NEs and 381,043 "instance of" relations.
Campo DC Valore Lingua
dc.authority.orgunit Istituto di linguistica computazionale "Antonio Zampolli" - ILC -
dc.authority.people Toral Ruiz A it
dc.authority.people Muñoz R it
dc.authority.people Monachini M it
dc.collection.id.s 71c7200a-7c5f-4e83-8d57-d3d2ba88f40d *
dc.collection.name 04.01 Contributo in Atti di convegno *
dc.contributor.appartenenza Istituto di linguistica computazionale "Antonio Zampolli" - ILC *
dc.contributor.appartenenza.mi 918 *
dc.date.accessioned 2024/02/19 19:46:24 -
dc.date.available 2024/02/19 19:46:24 -
dc.date.issued 2008 -
dc.description.abstract This paper presents the automatic extension of Princeton WordNet with Named Entities (NEs). This new resource is called Named Entity WordNet. Our method maps the noun is-a hierarchy of WordNet to Wikipedia categories, identifies the NEs present in the latter and extracts different information from them such as written variants, definitions, etc. This information is inserted into a NE repository. A module that converts from this generic repository to the WordNet specific format has been developed. The paper explores different aspects of our methodology such as the treatment of polysemous terms, the identification of hyponyms within the Wikipedia categorization system, the identification of Wikipedia articles which are NEs and the design of a NE repository compliant with the LMF ISO standard. So far, this procedure enriches WordNet with 310,742 NEs and 381,043 "instance of" relations. -
dc.description.affiliations Muñoz Rafael: University of Alicante, Spain. -
dc.description.allpeople Toral Ruiz A.; Muñoz R.; Monachini M. -
dc.description.allpeopleoriginal Toral Ruiz A.; Muñoz R.; Monachini M. -
dc.description.fulltext none en
dc.description.numberofauthors 1 -
dc.identifier.isbn 2-9517408-4-0 -
dc.identifier.isi WOS:000324028900129 -
dc.identifier.uri https://hdl.handle.net/20.500.14243/65096 -
dc.language.iso eng -
dc.relation.conferencedate 26-05/1-06-2008 -
dc.relation.conferencename LREC 2008, Sixth International Conference on Language Resources and Evaluation -
dc.relation.conferenceplace Marrakech, Marocco -
dc.relation.firstpage 741 -
dc.relation.lastpage 747 -
dc.subject.keywords Lexicon -
dc.subject.keywords Named Entity recognition -
dc.subject.keywords Ontologies -
dc.subject.keywords Lexical database -
dc.subject.singlekeyword Lexicon *
dc.subject.singlekeyword Named Entity recognition *
dc.subject.singlekeyword Ontologies *
dc.subject.singlekeyword Lexical database *
dc.title Named Entity WordNet en
dc.type.driver info:eu-repo/semantics/conferenceObject -
dc.type.full 04 Contributo in convegno::04.01 Contributo in Atti di convegno it
dc.type.miur 273 -
dc.type.referee Sì, ma tipo non specificato -
dc.ugov.descaux1 84722 -
iris.isi.metadataErrorDescription 0 -
iris.isi.metadataErrorType ERROR_NO_MATCH -
iris.isi.metadataStatus ERROR -
iris.orcid.lastModifiedDate 2024/03/01 14:36:34 *
iris.orcid.lastModifiedMillisecond 1709300194451 *
iris.sitodocente.maxattempts 3 -
Appare nelle tipologie: 04.01 Contributo in Atti di convegno
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/65096
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? 10
social impact