Abstract - This paper describes the main characteristics of the ItalWordNet semantic database, built in the context of the SI-TAL Italian National Project, within which a set of integrated resources and tools for the automatic treatment of the Italian language was realized. The database was created by extending the Italian wordnet developed within the EuroWordNet project, by adding: i) adjectives, adverbs and proper nouns (not dealt with in EuroWordNet); ii) a terminological subset related to the economic-financial domain. The relevant changes involved by these extensions both in the linguistic model and in the data structure are also illustrated. In particular, we discuss: i) the overall architecture of the database; ii) the semantic relations used to encode information on synsets; iii) the changes made to the EuroWordNet Top Ontology structure; iv) the specific characteristics of the terminological subset and the solutions adopted to link it to the generic wordnet. Keywords - synset, semantic database, wordnet, semantic

ItalWordNet: building a large semantic database for the automatic treatment of Italian

Roventini A;Marinelli R;
2003

Abstract

Abstract - This paper describes the main characteristics of the ItalWordNet semantic database, built in the context of the SI-TAL Italian National Project, within which a set of integrated resources and tools for the automatic treatment of the Italian language was realized. The database was created by extending the Italian wordnet developed within the EuroWordNet project, by adding: i) adjectives, adverbs and proper nouns (not dealt with in EuroWordNet); ii) a terminological subset related to the economic-financial domain. The relevant changes involved by these extensions both in the linguistic model and in the data structure are also illustrated. In particular, we discuss: i) the overall architecture of the database; ii) the semantic relations used to encode information on synsets; iii) the changes made to the EuroWordNet Top Ontology structure; iv) the specific characteristics of the terminological subset and the solutions adopted to link it to the generic wordnet. Keywords - synset, semantic database, wordnet, semantic
Campo DC Valore Lingua
dc.authority.orgunit Istituto di linguistica computazionale "Antonio Zampolli" - ILC -
dc.authority.people Roventini A it
dc.authority.people Alonge A it
dc.authority.people Bertagna F it
dc.authority.people Calzolari N it
dc.authority.people Cancila J it
dc.authority.people Girardi C it
dc.authority.people Magnini B it
dc.authority.people Marinelli R it
dc.authority.people Speranza M it
dc.authority.people Zampolli A it
dc.collection.id.s b3f88f24-048a-4e43-8ab1-6697b90e068e *
dc.collection.name 01.01 Articolo in rivista *
dc.contributor.appartenenza Istituto di linguistica computazionale "Antonio Zampolli" - ILC *
dc.contributor.appartenenza.mi 918 *
dc.date.accessioned 2024/02/19 00:46:04 -
dc.date.available 2024/02/19 00:46:04 -
dc.date.issued 2003 -
dc.description.abstract Abstract - This paper describes the main characteristics of the ItalWordNet semantic database, built in the context of the SI-TAL Italian National Project, within which a set of integrated resources and tools for the automatic treatment of the Italian language was realized. The database was created by extending the Italian wordnet developed within the EuroWordNet project, by adding: i) adjectives, adverbs and proper nouns (not dealt with in EuroWordNet); ii) a terminological subset related to the economic-financial domain. The relevant changes involved by these extensions both in the linguistic model and in the data structure are also illustrated. In particular, we discuss: i) the overall architecture of the database; ii) the semantic relations used to encode information on synsets; iii) the changes made to the EuroWordNet Top Ontology structure; iv) the specific characteristics of the terminological subset and the solutions adopted to link it to the generic wordnet. Keywords - synset, semantic database, wordnet, semantic -
dc.description.affiliations Alonge A: Università di Perugia; MagniniB.: IRST-Trento; Speranza M.: IRST-Trento; Cancila: IRST-Trento; Zampolli A. Direttore ILC Uni Pisa; Cancila J.: Girardi C.: -
dc.description.allpeople Roventini, A; Alonge, A; Bertagna, F; Calzolari, N; Cancila, J; Girardi, C; Magnini, B; Marinelli, R; Speranza, M; Zampolli, A -
dc.description.allpeopleoriginal Roventini A. , Alonge A. , Bertagna F. , Calzolari N. , Cancila J. , Girardi C. , Magnini B. , Marinelli R. 8, Speranza M. 9, Zampolli A. -
dc.description.fulltext none en
dc.description.note La risorsa IWN viene distribuita attraverso ELDA (ne sono state vendute diverse copie). IWN è compatibile con gli standard WordNet ed EWN ed è disponibile in formato XML. IWN è stata usata come risorsa lessicale di riferimento per la codifica semantica della ISST (Italian Syntactic Semantic Treebank) nel progetto TAL, per la seconda e terza edizione della competizione internazionale di sistemi di disambiguazione SENSEVAL e come base di conoscenza per un sistema di Question Answering per l’italiano sviluppato presso l'ILC. L'attività di ricerca legata allo sviluppo di IWN è accompagnata da una ricca produzione di pubblicazioni e presentata in numerosi congressi internazionali. Per la rilevanza del congresso nel panorama della disciplina e per la severità della selezione, ricordiamo l'articolo Alonge A., Bertagna F., Calzolari N., Roventini A., Zampolli A., Encoding information on adjectives in a lexical-semantic net for computational applications, in Proceedings NAACL 2001. -
dc.description.numberofauthors 10 -
dc.identifier.uri https://hdl.handle.net/20.500.14243/37667 -
dc.relation.firstpage 745 -
dc.relation.lastpage 791 -
dc.relation.volume 18-19 -
dc.subject.keywords Database lessicale -
dc.subject.keywords Rete semantica -
dc.subject.keywords Relazioni semantiche -
dc.subject.keywords Risorse linguistiche -
dc.subject.singlekeyword Database lessicale *
dc.subject.singlekeyword Rete semantica *
dc.subject.singlekeyword Relazioni semantiche *
dc.subject.singlekeyword Risorse linguistiche *
dc.title ItalWordNet: building a large semantic database for the automatic treatment of Italian en
dc.type.driver info:eu-repo/semantics/article -
dc.type.full 01 Contributo su Rivista::01.01 Articolo in rivista it
dc.type.miur 262 -
dc.type.referee Sì, ma tipo non specificato -
dc.ugov.descaux1 64479 -
iris.orcid.lastModifiedDate 2024/04/04 10:48:30 *
iris.orcid.lastModifiedMillisecond 1712220510551 *
iris.sitodocente.maxattempts 3 -
Appare nelle tipologie: 01.01 Articolo in rivista
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/37667
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact