This paper presents the automatic extension of Princeton WordNet with Named Entities (NEs). This new resource is called Named Entity WordNet. Our method maps the noun is-a hierarchy of WordNet to Wikipedia categories, identifies the NEs present in the latter and extracts different information from them such as written variants, definitions, etc. This information is inserted into a NE repository. A module that converts from this generic repository to the WordNet specific format has been developed. The paper explores different aspects of our methodology such as the treatment of polysemous terms, the identification of hyponyms within the Wikipedia categorization system, the identification of Wikipedia articles which are NEs and the design of a NE repository compliant with the LMF ISO standard. So far, this procedure enriches WordNet with 310,742 NEs and 381,043 "instance of" relations.
Named Entity WordNet
Monachini M
2008
Abstract
This paper presents the automatic extension of Princeton WordNet with Named Entities (NEs). This new resource is called Named Entity WordNet. Our method maps the noun is-a hierarchy of WordNet to Wikipedia categories, identifies the NEs present in the latter and extracts different information from them such as written variants, definitions, etc. This information is inserted into a NE repository. A module that converts from this generic repository to the WordNet specific format has been developed. The paper explores different aspects of our methodology such as the treatment of polysemous terms, the identification of hyponyms within the Wikipedia categorization system, the identification of Wikipedia articles which are NEs and the design of a NE repository compliant with the LMF ISO standard. So far, this procedure enriches WordNet with 310,742 NEs and 381,043 "instance of" relations.| Campo DC | Valore | Lingua |
|---|---|---|
| dc.authority.orgunit | Istituto di linguistica computazionale "Antonio Zampolli" - ILC | - |
| dc.authority.people | Toral Ruiz A | it |
| dc.authority.people | Muñoz R | it |
| dc.authority.people | Monachini M | it |
| dc.collection.id.s | 71c7200a-7c5f-4e83-8d57-d3d2ba88f40d | * |
| dc.collection.name | 04.01 Contributo in Atti di convegno | * |
| dc.contributor.appartenenza | Istituto di linguistica computazionale "Antonio Zampolli" - ILC | * |
| dc.contributor.appartenenza.mi | 918 | * |
| dc.date.accessioned | 2024/02/19 19:46:24 | - |
| dc.date.available | 2024/02/19 19:46:24 | - |
| dc.date.issued | 2008 | - |
| dc.description.abstract | This paper presents the automatic extension of Princeton WordNet with Named Entities (NEs). This new resource is called Named Entity WordNet. Our method maps the noun is-a hierarchy of WordNet to Wikipedia categories, identifies the NEs present in the latter and extracts different information from them such as written variants, definitions, etc. This information is inserted into a NE repository. A module that converts from this generic repository to the WordNet specific format has been developed. The paper explores different aspects of our methodology such as the treatment of polysemous terms, the identification of hyponyms within the Wikipedia categorization system, the identification of Wikipedia articles which are NEs and the design of a NE repository compliant with the LMF ISO standard. So far, this procedure enriches WordNet with 310,742 NEs and 381,043 "instance of" relations. | - |
| dc.description.affiliations | Muñoz Rafael: University of Alicante, Spain. | - |
| dc.description.allpeople | Toral Ruiz A.; Muñoz R.; Monachini M. | - |
| dc.description.allpeopleoriginal | Toral Ruiz A.; Muñoz R.; Monachini M. | - |
| dc.description.fulltext | none | en |
| dc.description.numberofauthors | 1 | - |
| dc.identifier.isbn | 2-9517408-4-0 | - |
| dc.identifier.isi | WOS:000324028900129 | - |
| dc.identifier.uri | https://hdl.handle.net/20.500.14243/65096 | - |
| dc.language.iso | eng | - |
| dc.relation.conferencedate | 26-05/1-06-2008 | - |
| dc.relation.conferencename | LREC 2008, Sixth International Conference on Language Resources and Evaluation | - |
| dc.relation.conferenceplace | Marrakech, Marocco | - |
| dc.relation.firstpage | 741 | - |
| dc.relation.lastpage | 747 | - |
| dc.subject.keywords | Lexicon | - |
| dc.subject.keywords | Named Entity recognition | - |
| dc.subject.keywords | Ontologies | - |
| dc.subject.keywords | Lexical database | - |
| dc.subject.singlekeyword | Lexicon | * |
| dc.subject.singlekeyword | Named Entity recognition | * |
| dc.subject.singlekeyword | Ontologies | * |
| dc.subject.singlekeyword | Lexical database | * |
| dc.title | Named Entity WordNet | en |
| dc.type.driver | info:eu-repo/semantics/conferenceObject | - |
| dc.type.full | 04 Contributo in convegno::04.01 Contributo in Atti di convegno | it |
| dc.type.miur | 273 | - |
| dc.type.referee | Sì, ma tipo non specificato | - |
| dc.ugov.descaux1 | 84722 | - |
| iris.isi.extIssued | 2008 | - |
| iris.isi.extTitle | Named Entity WordNet | - |
| iris.orcid.lastModifiedDate | 2024/03/01 14:36:34 | * |
| iris.orcid.lastModifiedMillisecond | 1709300194451 | * |
| iris.sitodocente.maxattempts | 3 | - |
| isi.category | OT | * |
| isi.contributor.affiliation | Consiglio Nazionale delle Ricerche (CNR) | - |
| isi.contributor.affiliation | - | |
| isi.contributor.affiliation | Consiglio Nazionale delle Ricerche (CNR) | - |
| isi.contributor.country | Italy | - |
| isi.contributor.country | - | |
| isi.contributor.country | Italy | - |
| isi.contributor.name | Antonio | - |
| isi.contributor.name | Rafael | - |
| isi.contributor.name | Monica | - |
| isi.contributor.researcherId | OQJ-6695-2025 | - |
| isi.contributor.researcherId | H-3101-2015 | - |
| isi.contributor.researcherId | F-3077-2015 | - |
| isi.contributor.subaffiliation | Ist Linguist Computaz | - |
| isi.contributor.subaffiliation | - | |
| isi.contributor.subaffiliation | Ist Linguist Computaz | - |
| isi.contributor.surname | Toral | - |
| isi.contributor.surname | Munoz | - |
| isi.contributor.surname | Monachini | - |
| isi.date.issued | 2008 | * |
| isi.description.allpeopleoriginal | Toral, A; Muñoz, R; Monachini, M; | * |
| isi.document.sourcetype | WOS.ISSHP | * |
| isi.document.type | Proceedings Paper | * |
| isi.document.types | Proceedings Paper | * |
| isi.identifier.isi | WOS:000324028900129 | * |
| isi.journal.journaltitle | SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008 | * |
| isi.language.original | English | * |
| isi.publisher.place | 55-57, RUE BRILLAT-SAVARIN, PARIS, 75013, FRANCE | * |
| isi.relation.firstpage | 741 | * |
| isi.relation.lastpage | 747 | * |
| isi.title | Named Entity WordNet | * |
| Appare nelle tipologie: | 04.01 Contributo in Atti di convegno | |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


